Page MenuHomePhabricator

Inconsistent Avg. Daily Pageview results between Pages Improved and Event Summary
Closed, ResolvedPublicBUG REPORT

Description

I found an inconsistency in the Avg. Daily Pageview numbers in the Pages Improved report vs. Event Summary reports. To reproduce:

  • Download the Pages Improved csv for an event. I the testing event "At Least One of All Contribution Types" (In the Program "joe's testing events").
  • Add up the Avg. Daily Pageviews numbers for en.wiki pages only.
  • Compare to the Event Summary onscreen report, which gives a per-wiki breakdown and also shows Avg. Views for en.wiki.
  • Expected results: the numbers should be the same.
  • Actual results: As of 2019-03-28 18:13, Pages Improved shows 13,515 but the Summary number is 12,937—a pretty big difference.

@dom_walden confirms he saw this discrepancy elsewhere as well. It looks like a math problem, maybe?

Event Timeline

I put this directly into the Ready column. It would be nice if we could clear it up today, since we are clearly giving wrong numbers.

At first glance, it appears to largely use the same functions which is interesting. I'm taking a closer look.

As best I can tell, both of those reports end up resolving down to PageviewsRepository::getPageviewsPerArticle() for the source of the data.

I'll ask a naive question: The Pages Improved number is the average pageviews for improved pages only. The Event Summary number is the average pageviews for all pages in the Event. That might include pages that are created but not improved. Isn't it logical that these numbers could be different?

Finally figured it out, it was getting pageviews since the start of the event and not a 31 day window. PR: https://github.com/wikimedia/eventmetrics/pull/261

jmatazzoni moved this task from QA to Q3 2018-19 on the Community-Tech-Sprint board.

The numbers match! Resolving this.