Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | fkaelin | T341817 Standardize research pipelines - Dataset generation | |||
Resolved | fkaelin | T348819 Develop pipelines for research datasets - Q2 | |||
Resolved | Pablo | T345345 Risk Observatory Data for Disinformation Response Team around Elections | |||
Resolved | None | T343065 Scheduled risk observatory pipeline | |||
Resolved | • nickifeajika | T349614 Archeology on the notebooks / documentation | |||
Resolved | • nickifeajika | T349615 Implement risk obsevatory pipeline | |||
Resolved | MunizaA | T348367 Create a python package to compute wikitext embeddings in the WMF data infra |
Event Timeline
Comment Actions
Updates
- Created a new repo research-datasets to consolidate production research pipelines that produce datasets
- Nearly completed analysis of risk-observatory notebooks/documentation (T349614)
Comment Actions
This work is completed, the pipelines that were added:
- Risk observatory pipeline (airflow dag)
- Wikidiff pipeline (airflow dag)
- Article embeddings pipeline