- Dependencies
- new column: mediawiki_history.event_user_is_permanent as per T356701#10237761
- HQL logic that needs to change
- Since this logic just copies is_anonymous from its source table, MediaWiki History, our task here is probably to copy the two new fields: is_temporary and is_permanent as well. This means everything downstream will change.
- HQL table creation scripts that need to change
- Just the edit_hourly table here, rest in downstream section below
- Deployment plan script
- <<plan steps>>
- Airflow DAG that schedules the HQL logic
Testing notes
List of affected downstream pipelines that we discover
Vetting notes
- run old data through new code and comparing yields identical results to old job results
- run new data through new code yields expected results. A list of wikis where temp accounts is deployed should be leveraged here.
- check logged-in users
- check temp accounts
- check anonymous accounts
Once vetting is complete, deploy according to deployment script in related task.