Context:
Wikidata Analytics
User story:
As Wikidata PMs we want to get a better understanding of the relationship of Items and editors over time to inform our product strategy.
Problem:
We monitor the number of total Items and the number of active editors per day already (internal). We currently do not have a Graphana board and we do not monitor the number of active Items. The number of active Items is used as an indicator for potentially vandalized Items here.
Acceptance criteria:
- Track number of active items (the number of items that were touched at least once in the last 30 days) per day
- Create Graphana dashboard illustrating the following:
- the number of Items (tracked already)
- the number of active Items
- the number of active editors (tracked already)
- the proportion of items / active editors (with the three definitions of 1, 5, and 100 edits per editor)
- the proportion of active items / active editors (with the three definitions of 1, 5, and 100 edits per editor)
Open questions:
- Could we easily track this for segments of Wikidata?
- Yes, but would be inefficient, so let's not do this for now.
- Where should we put this?
- To the site stats Grafana dashboard for now.
Origin:
Community request
Tech note:
Take a look at this Gerrit repo https://gerrit.wikimedia.org/r/admin/repos/analytics/wmde/scripts (on github)
Docs at https://wikitech.wikimedia.org/wiki/WMDE/Analytics#analytics/wmde/scripts_repo too