Page MenuHomePhabricator

[XL] Post-upload job to detect logos
Closed, ResolvedPublic

Description

Create a post-upload job to rescale uploaded images and test them against the LiftWing logo detection model

If they're logos then use wbsetclaim to add instance of:logo with the qualifier determination method:machine learning

We'll probably also need to check WikibaseMediaInfo to make sure instance of gets written to the search index

Event Timeline

Note that this should NOT be part of UploadWizard, because we want the detection to happen for all uploads. Note that non-WMF installs won't have access to the logo detection endpoint

So we think we'll proceed like this for now (simplest approach, because it's a PoC*)

  • update the UW mediadetection api to accept filename as well as filekey
  • have a script running continuously somewhere (on toolforge?) listening to MW events, that calls that api whenever a file is uploaded and writes (structured or template) data based on the result

*for a non-PoC version we think we probably ought to have a non-MW api endpoint (written in node?) where you can send it an image/thumbnail and get back a likelihood-of-deletion score (plus other information)

AUgolnikova-WMF renamed this task from Post-upload job to detect logos to [XL] Post-upload job to detect logos.Jul 4 2024, 9:55 AM
mfossati changed the task status from Open to In Progress.Jul 10 2024, 4:11 PM
mfossati claimed this task.

Unblocked: directly load the logo detection model within the same code.

Blocked by T370861: [L] Refine the logo detection model: a monitoring job is currently running on Toolforge via https://toolsadmin.wikimedia.org/tools/id/gogologo, but we'll need to switch to the refined model.

Plugged secondary models from T370861: [L] Refine the logo detection model into the monitoring job.
Closing, follow-up expected after T370137: Set up a bot account on Commons to add logo-detection statements 's resolution.