Page MenuHomePhabricator

Wikistats 2 pageviews trend figure is wrong
Open, HighPublic

Description


This page [1] says "Total: 46.37B -41.80% over this time range."
This is a meaningless metric, and very obviously also false.

Problem is: what the script does is take the last and first count and calculate the ratio.

  1. Obviously the last count is for an incomplete month, it's 80-90 % less than previous months.
  2. Any trend analysis which ignores all measurements except two is painting a meaningless picture.

Lots of sophistication could be applied. A rather simple one (KISS) would be calculation of the ratio of averages for first 12 months and last 12 months (excluding instead of focusing on first and last month which could be incomplete months).

Better not compare first and last calendar year, as that could be based on 12 months, but also on 1 and anything in between. So in this case I'd recommend calculating average for all months in 2008 vs average of Aug 2015-July 2016, ignoring Dec 2017 and Aug 2016.

Also presenting the total for all months (46 B) is quite uncommon for bar charts and likely to be misinterpreted.

[1] https://stats.wikimedia.org/v2/#/it.wikipedia.org/reading/legacy-page-views/normal|bar|All|~total

Event Timeline

Restricted Application added a project: Analytics. · View Herald TranscriptDec 15 2018, 1:07 PM
Restricted Application added a subscriber: Aklapper. · View Herald Transcript
ezachte updated the task description. (Show Details)Dec 15 2018, 1:09 PM
fdans added a subscriber: fdans.Dec 17 2018, 5:15 PM

For now we'll be removing the time period trend since as you said it doesn't add value in its current form. We'll task later on how to display changes over time periods in a more effective way.

fdans triaged this task as High priority.Dec 17 2018, 5:16 PM
fdans moved this task from Incoming to Wikistats Production on the Analytics board.