(Task intended to form part of a possible Outreachy internship in data analysis with the WMF reading team)
Vet and explore a new privacy-friendly web readership retention metric (based on a data source selected by the Reading team), and build a reporting mechanism for it.
- (~ 2 weeks) A report examining various possible specifications of this metric (e.g. choice of percentile, etc.), their possible data quality issues and suggestions how to fix or mitigate them, and an assessment of their sensitivity and robustness
- (~ 1 week) An exploratory analysis showing how the chosen metric differs across various dimensions, e.g. project language or geographical region
- (~ 1 week) A workflow or an automated tool to regularly inform the Reading team and the Wikimedia movement on how this metric is developing