Page MenuHomePhabricator

Compare early results of Wikistats 2.0 with Wikistats 1.0
Closed, DuplicatePublic3 Story Points

Description

Wikistats 1.0 and Wikistats 2.0 will use entirely different data streams and approach.

We want to get a notion asap of how and why results from these two processes differ.
Some differences will be accidental (bugs), maybe in the old process, maybe in the new one. Some differences may reveal definition discrepancies. Some could be the result of different methodology.

Once we explained these differences we will have learned a lot about validation issues.

We will start to compare numbers for Simple Wikipedia, beginnning with https://stats.wikimedia.org/EN/TablesWikipediaSIMPLE.htm#editor_activity_levels

So if Dan can produce a simplified version (maybe just one or two columns), both he and Erik can work on quantifying and explaining the differences.

Event Timeline

ezachte created this task.Jul 28 2016, 4:31 PM
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJul 28 2016, 4:31 PM
ezachte renamed this task from Compare Wikistats data with Hive data to Compare early results of Wikistats 2.0 with Wikistats 1.0.Jul 28 2016, 4:33 PM
Milimetric triaged this task as Normal priority.Jul 28 2016, 5:37 PM
Milimetric set the point value for this task to 3.Aug 18 2016, 3:42 PM
Milimetric edited projects, added Analytics-Kanban; removed Analytics.