Page MenuHomePhabricator

Implement IE7 correction for long-term trend charts
Open, Needs TriagePublic

Description

  • Augment this and this chart with the correction method already used to remove this spurious traffic from the recently reported monthly pageview numbers
  • publicly document the correction calculation

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptSep 15 2017, 10:48 PM
Tbayer updated the task description. (Show Details)Nov 2 2017, 9:58 PM
Tbayer moved this task from Triage to Doing on the Product-Analytics board.Apr 26 2018, 8:23 PM

For the record, below is an example of the queries I have been using for this. This was based on the detailed analysis in https://phabricator.wikimedia.org/T157404 (for Pakistan - task set to private because the examination involved looking at some IP information), while including two other countries - Iran and Afghanistan - that showed a similarly anomalous pattern of IE7 views widely surpassing those from newer IE versions.

Note though that the recent ua-parser upgrade raised new questions about this: T193578#4238244

SELECT year, month, day, CONCAT(year,'-',LPAD(month,2,'0'),'-',LPAD(day,2,'0')) AS date,
SUM(IF(access_method = 'mobile app', view_count, null)) AS Apps,
SUM(IF(access_method = 'desktop', view_count, null)) AS Desktop,
SUM(IF(access_method = 'mobile web', view_count, null)) AS MobileWeb
FROM wmf.pageview_hourly
WHERE year > 0
AND agent_type='user'
AND NOT (country_code IN ('PK', 'IR', 'AF') -- https://phabricator.wikimedia.org/T157404#3194046
AND user_agent_map['browser_family'] = 'IE' AND user_agent_map['browser_major'] = 7)
GROUP BY year, month, day ORDER BY year, month, day LIMIT 1000;
JKatzWMF moved this task from Doing to Stalled on the Product-Analytics board.Oct 4 2018, 8:32 PM