Page MenuHomePhabricator

Metadata improvement campaign
Open, LowPublic

Description

I've been thinking. Even now that we don't have reliable information just yet, we could probably make some sort of 'reliability' assessment of the data that we scraped.

The goal is to improve the quality of the data presented by MediaViewer, it would spur the community to action on these issues (which are now only visible as 'failures' in media viewer, and not as 'editor feedback' in file description pages.

On the file description, we could show a bar to registered users with more than 100 edits saying: "This image has 5% machine readable data. If you would care to help improve this, please join the cleanup campaign."

The user could click it and you would see a list of "things that are missing or unknown, or badly formatted". The campaign page would list tools like the "Add {{Information}}" gadget that is available to users and similar data.

We could also give 'bonus' points for usage of {{data}}, {{author}} or similar templates that are able to provide more specific and more semantic data to us when moving stuff to a wikidata layer.

Perhaps Magnus Manske and Riilke would be able to assist in setting something like that up with the community. Later we can retool all that into a gadget that works for the migration to wikidata layer and people would already be used to this workflow (and we can perhaps provide aggregate metrics by moving everything to a better layer at that point in time).


Version: master
Severity: normal
See Also:
https://bugzilla.wikimedia.org/show_bug.cgi?id=69557

Details

Reference
bz69534

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 3:29 AM
bzimport added a project: MediaViewer.
bzimport set Reference to bz69534.
bzimport added a subscriber: Unknown Object (MLST).
TheDJ created this task.Aug 14 2014, 12:54 PM
Tgr added a comment.Aug 14 2014, 6:50 PM

Currently the only thing we could easily check is whether there is an Information template present and whether there is a license template present. (And coordinates, but we can't tell whether a file missing a coordinate template should have one.)

TheDJ added a comment.Aug 27 2014, 9:54 AM

https://commons.wikimedia.org/wiki/User:TheDJ/datacheck.js

JS evaluator for this under development (will take me a few weeks to finish probably, with my time constraints).

Needs a project page on Commons etc..

Gilles triaged this task as Low priority.Nov 24 2014, 1:58 PM
Gilles added a subscriber: Gilles.

Superseded by the Metadata cleanup drive?

Tgr added a subscriber: gpaumier.Nov 26 2014, 12:36 AM
Tgr added subscribers: Aklapper, MingleTerminator.

From the duplicate:

  • When fundamental data (author, source, license) is missing or not machine-parsable, show a call to action explaining how to fix it, and a link through which the user can fix it right now
  • (optionally) limit the call to action to users who are experienced enough (logged-in, >N image edits?)
  • (optionally) make use of Magnus' add-information tool
Tgr added a comment.Dec 11 2014, 3:09 AM

Probably superseded by the file metadata cleanup drive, yeah.

Restricted Application added a subscriber: Matanya. · View Herald TranscriptAug 14 2015, 8:48 PM
Jdforrester-WMF moved this task from Untriaged to Backlog on the Multimedia board.Sep 4 2015, 6:09 PM

Mass-removing the Multimedia tag from MediaViewer tasks, as this is now being worked on by the Reading department, not Editing's Multimedia team.

Restricted Application added a subscriber: PokestarFan. · View Herald TranscriptAug 7 2017, 5:47 PM
gpaumier removed a subscriber: gpaumier.Jul 18 2018, 6:07 PM
Tgr removed a subscriber: Tgr.Jul 9 2019, 6:04 PM
Restricted Application added a subscriber: Liuxinyu970226. · View Herald TranscriptJul 9 2019, 6:04 PM