Page MenuHomePhabricator

Vet and explore new readership retention metric
Open, NormalPublic


(Task intended to form part of a possible Outreachy internship in data analysis with the WMF reading team)

Vet and explore a new privacy-friendly web readership retention metric (based on a data source selected by the Reading team), and build a reporting mechanism for it.

  1. (~ 2 weeks) A report examining various possible specifications of this metric (e.g. choice of percentile, etc.), their possible data quality issues and suggestions how to fix or mitigate them, and an assessment of their sensitivity and robustness
  2. (~ 1 week) An exploratory analysis showing how the chosen metric differs across various dimensions, e.g. project language or geographical region
  3. (~ 1 week) A workflow or an automated tool to regularly inform the Reading team and the Wikimedia movement on how this metric is developing

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 15 2016, 2:13 AM
Tbayer assigned this task to Zareenf.Dec 8 2016, 2:52 AM

This project is currently in progress. I've vetted different specifications of this metric (averages and various percentiles) and examined across different dimensions (device type, Wikipedia project languages). I still need to finalize the queries to calculate the metric, create more charts, and set up an oozie job to automate updating the metric.

Tbayer moved this task from Triage to Backlog on the Product-Analytics board.Jun 21 2018, 8:24 PM
MBinder_WMF triaged this task as Normal priority.Aug 2 2018, 8:17 PM
JKatzWMF moved this task from Backlog to Triage on the Product-Analytics board.Aug 2 2018, 8:19 PM
JKatzWMF reassigned this task from Zareenf to Tbayer.Aug 3 2018, 3:59 PM
JKatzWMF added a subscriber: Zareenf.

the next step here is for @Tbayer to break up the remaining items on this task as necessary for someone to work on

This comment was removed by JKatzWMF.
JKatzWMF moved this task from Triage to Backlog on the Product-Analytics board.Aug 3 2018, 4:01 PM
Tbayer moved this task from Backlog to Doing on the Product-Analytics board.Sep 20 2018, 10:01 PM
MBinder_WMF moved this task from Doing to Icebox on the Product-Analytics board.Apr 18 2019, 7:26 PM
Aklapper removed Tbayer as the assignee of this task.Apr 22 2019, 11:48 AM

Resetting task assignee as the user is not active here anymore.