This script: https://github.com/wikimedia/analytics-limn-edit-data/blob/master/edit/save_rates_low_noise.sql (terrible name) needs to run every day, in a reliable way. It should be rerun if it missed a run and it should be re-runnable for one or more of the days it already ran, just in case we change the logic or had bad data (backfilling etc.)
Write side script that gets called by generate.py and schedules VE metrics.
It should focus on reliability, consistent logging, monitoring.
- rerunning days specified (specified how?)
- rerunning failed / missed days (by hand or by identifying what days have not run)
- we should be able to easily turn it off/on
Make sure generate.py doesn't fork to the new VE metric schedule for other teams (language, mobile, etc)
- edit the config.yaml from the limn-edit-data/edit