As Wikidata PMs we want to get a better understanding of content quality in Wikidata to inform our product strategy.
What we already have:
- ORES quality dataset
- Wikidata ORES Quality Report, 2020
- re-use statistics and number of items per ORES class
Current iteration:
Goal
- Understand the current distribution of the ORES quality scores across the Wikidata’s classes better.
Steps
- T285458: Generate inputs for 1st sensemaking session about ORES quality score distributions across the Wikidata classes
- joint sensemaking session together with PMs about possible ways to simplify the results in a meaningful way (e,g, clusters of classes)
- visualization of relevant findings for strategic analytics report
Future iterations:
Goals
- provide a set of actional insights that could be shared with the community on what classes are critical in terms of item quality and where the improvements are necessary
- derive a more strategic insight into the possible future evolution of item quality in Wikidata given its current state
- illustrate why Wikipedia-like quality criteria are not feasible for Wikidata (incl. https://wikitech.wikimedia.org/wiki/User:AKhatun/Wikidata_Vertical_Analysis)
Steps
Past iterations:
Related:
- community would like regular insights https://www.wikidata.org/wiki/Wikidata_talk:Statistics/Wikipedia#Actuality