Main components:
- Wikidata Analytics
User story:
As a new Data Analyst for Wikidata,
I want to get a good understanding of the most important analytics products and tools
in order to have a chance to maintain them.
Problem:
The whole Wikidata Analytics codebase is overwhelming. But a new Wikidata Analyst will have to be able to work at least on our most important and most frequently used tools and dashboards. We need to ensure that the Wikidata team can run these tools sustainably even in a transition phase.
Solution:
This is why it is particularly important to bring the related code in shape and document it well. All related processes need to be documented as well.
Most important tools:
These are the most important sources and dashboards for us and our editors:
Wikidata Quality Dashboard (for editors)
- Helps the community to identify important Items that need protection.
- We only have a report so far: https://wikidata-analytics.wmcloud.org/app_direct/WD_docs/Wikidata%20Quality%20Report.nb.htm
- We are working on a dashboard now: T292862: Create an automatically updated dashboard out of the Wikidata Quality Report
100k most popular Items data set (for editors)
- Editors use this downloadable data set to protect these Items.
- https://wikidata-analytics.wmcloud.org/app_direct/WikidataAnalytics/datasets.html
Wikidata Current Events (for editors)
- Helps the community to track current events.
- https://wikidata-analytics.wmcloud.org/app/CurrentEvents
Wikidata Usage and Coverage (for reporting)
- Helps us to understand which Wikimedia Projects use what Wikidata content.
- https://wikidata-analytics.wmcloud.org/app/WD_percentUsageDashboard
Wikidata Concepts Monitor - Statements Dashboard
- We need the Property>Reuse sub page!
- This helps us to understand what properties are used often in the Wikimedia projects.
- https://wikidata-analytics.wmcloud.org/app/WDCM_StatementsDashboard
Wikidata Languages Landscape (essential analytics)
- We need the Datamodel and Label Sharing pages only!
- We use this to better understand language coverage for labels, descriptions and aliases.
- https://wikidata-analytics.wmcloud.org/app/WD_LanguagesLandscape
Wikidata Concepts Monitor - Usage Dashboard
- We need the Project Summary page!
- Helps us to understand what topics Wikidata’s data is used for in the other Wikimedia projects.
- https://wikidata-analytics.wmcloud.org/app/WDCM_UsageDashboard
Wikidata Pageviews per Namespace
- This helps us to understand the page views per Wikidata namespace.
- https://wikidata-analytics.wmcloud.org/app/WD_pageviewsPerNamespace
Wiktionary Cognate Dashboard
Acceptance criteria:
These goals:
- Ensure good documentation of code
- Ensure good documentation of processes to maintain the tool (e.g. deployment)
- Ensure high stability (the tools should ideally run without maintenance in the next years)
For these tools:
- Wikidata Quality Report
- 100k most popular Items data set
- Wikidata Current Events
- Wikidata Usage and Coverage
- Wikidata Concepts Monitor, Statements Dashboard, Property>Reuse sub page
- Wikidata Languages Landscape, Datamodel and Label Sharing pages
- Wikidata Concepts Monitor, Usage Dashboard, Project Summary page
- Wikidata Pageviews per Namespace
- Wiktionary Cognate Dashboard