I am a current candidate for the Outreachy Internship program with Wikimedia Foundation. I aim to work with the Reading Team.
Jul 18 2018
Feb 24 2018
Mar 27 2017
Mar 21 2017
Mar 3 2017
This project is currently in progress. I've completed initial data quality checks on the table and developed the queries for average and percentiles.
This project is currently in progress. I've vetted different specifications of this metric (averages and various percentiles) and examined across different dimensions (device type, Wikipedia project languages). I still need to finalize the queries to calculate the metric, create more charts, and set up an oozie job to automate updating the metric.
Hi all. Below are initial results from the data quality checks (the queries and outputs are documented in the scratchpad @Tbayer linked). No significant anomalies have been found, so far, besides the ~16% of pageTokens appearing in only 1 row, which was kind of expected.
Feb 1 2017
Jan 6 2017
Dec 14 2016
Oct 31 2016
@Dzahn here is a new SSH key for production access:
Oct 27 2016
Oct 26 2016
@Krenair it doesn't seem like statistics-users adds any additional access than researchers so I will remove that from my request https://wikitech.wikimedia.org/wiki/Analytics/Data_access#Production_access
Oct 20 2016
Oct 18 2016
Oct 17 2016
Oct 16 2016
Oct 15 2016
Results for 100 most frequent section titles (filtered for only articles), their frequency, and percentage:
Results for 100 most frequent section titles and frequency: