As a machine learning engineer,
I want to update the add-a-link models by using the learnings from T393474 and options considered in this document.
We focus on following goals:
- Scale Add-a-Link model across more languages FY2024 WE1.2
- Retrain Add-a-Link models FY2024 WE1.2
Following steps will be implemented to achieve the goals:
- Move pipeline from research datasets to ml pipelines.
- Add training Airflow DAG in ml airflow
- Add qid filter per wiki logic. Add enwiki, country names. Report new results on enwiki
- Training on all languages on the new pipeline.
- Add staging release pipeline steps:
- check if wikis are above the threshold.
- to export hdfs to pkl
- generate_sqlite_data
- create_tables
- copy-sqlite-to-mysql
- Staging release all languages above threshold.
- Airflow dag for staging release. (has a pre-defined list of wikis to release)
- Add prod release dag:
- export from staging db.
- copy exports to a shared place.
- Airflow dag for prod release (has a pre-defined list of wikis to release).
- Update inference service to use new models.
- Consistent package versions.
- Add new input wiki_db.
- Test if it works with both old and new models.
- Fix https://phabricator.wikimedia.org/T275358
- Release jawiki to staging for testing end-to-end. (It's ok to release jawiki as it's not used by growth team yet.)
- Release jawiki to prod for testing end-to-end. (It's ok to release jawiki as it's not used by growth team yet.)
- Manually calculate current accuracy performance of major wikis on prod before release.
- Release models. (already automated every 30 minutes.)
- TBD: how to release iteratively
- Manually calculate current accuracy performance of major wikis on prod after release (in X months).
- Removed from scope:
- (Optional) Experiment on less models.
- Cluster based on derived features on the training sets.
Reporting format
Progress update on the hypothesis for the week, including if something has shipped:
Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):
Any emerging blockers or risks:
Any unresolved dependencies:
New lessons from the hypothesis:
Changes to the hypothesis scope or timeline:



