Background/Goal
Product Analytics maintains a Superset dashboard of IP Masking (Temp Accounts) metrics for Trust & Safety Product. Most of these metrics are calculated from sqooped-up MediaWiki tables in the data lake (in wmf_raw). One table is missing – cu_log – and requires us to calculate the Admin Requests metric by querying MariaDB analytics replicas.
Currently all these metrics are calculated from a Jupyter notebook scheduled to run under @jwang's user account. We are starting to migrate the queries to Airflow (T364406) and will be able to migrate all but one which depends on cu_log.
Please add it to the list of MW tables that are sqooped up and made available as monthly snapshots in the data lake. Thank you!
KR/Hypothesis(Initiative)
Temporary accounts for unregistered users
Success metrics
- How we will measure success
Example areas:
- Deadlines
- User satisfaction
- Performance
- Accessibility
- Maintenance
- Movement impact
- Scalability
- Data Quality
- Integration
- Compliance
In scope
- Monthly snapshots of cu_log available in the data lake
Out of Scope
- known boundaries