Some ideas including those that came up during our knowledge transfer of high level metrics calculation with Connie. We can break these out into separate tasks if required.
- Instrument
[x] Test ETL editors notebooks
[] finalize tests and instrument all
- Tables
[] Enable movement metric calculation as analytics-product user: T295332, T288677, T291957, T291956
[] Also, need to look at Connie’s tables: cchen.repo_active_editors & cchen.new_editors --> when these are moved, also need to update Superset and [[ https://github.com/wikimedia-research/wiki-comparison/blob/master/data-collection/data-collection.ipynb | wiki comparison ]] T294653
- refactoring & adding diversity
[] Creating - add diversity for new and returning active editors
[] Need help : refactor combine all inserts : refactoring
- Calculations
[] Add functions in editing-movement notebook (03.report) to calculate net new content (non-wikidata)
[] Consolidate and Make [[ https://docs.google.com/spreadsheets/d/1mK-R8qWzKjSeHMBBek9sJsbecdic9s3r28OIW7QkqrE/edit#gid=476321462 | readers ]]-[[ https://docs.google.com/spreadsheets/d/1wfTtHjQP9Kj0WME15ESJ-4dSMGMpbtY8qOuDVcwZovQ/edit#gid=1862467345 | editors ]] google sheet more reader friendly. Remove mobile-heavy metrics tab, add new columns like YoY, FY average, quarterly average, FY YTD average etc. ([[ https://docs.google.com/spreadsheets/d/1D7aJxhhA4apxRUVjKf_Vs6M8fleIlH-kVY1hefRP9Nw/edit#gid=781717421 | reference document ]])
No longer required as we will not be using MMTP sheets
~~[] Update the rpt repos to calculate data for the Movement metrics tables preparation sheets file~~
~~[] Work on consolidating and making the MMPT, Movement Metrics Preparation Table sheets more reader friendly (tables stay in MMTP only...keep YoY)~~
- Diversity sheet
[] net new non-wikidata content = net new content MINUS new wikidata content
[] YOY - editors sheet (jupyter notebook has all data) - sum rows for the previous year (net new content MINUS new wikidata content )
- Platform Evolution sheet
[] % from baseline = status column
[] % of wiki data items ---> see wikidata items being reused (status)
- Platform Evolution R Viz
[] Starting y axis at 0
[] Ticks: Major breaks: 50 mil
[] Ticks: Minor breaks: 10mil
[] Remove x lines
[] Geom point only the latest point and those others to especially highlight
- Colors
[] Switch to gray for previous year and only use blue for current year
- Discuss
~~[] Use google sheet macro to update metrics in the correct format (instead of manually updates)~~
We are no longer going to update [[ https://docs.google.com/presentation/d/1D_MuQ4Cf23Agn1o_ausJtH5rrJysqtGIYzmK8xxEX7M/edit#slide=id.g4463d16142_0_0 | board deck ]] with detailed analysis. Only summary slides will be added every month
~~[] Add notes and analysis to the [board staging deck](https://docs.google.com/presentation/d/1D_MuQ4Cf23Agn1o_ausJtH5rrJysqtGIYzmK8xxEX7M/edit#slide=id.g4463d16142_0_0) ~~
[] Tuning Session: Create a notebook to automate the calculations and output calculated numbers to copy/paste into the presentations