Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | Halfak | T243451 Deploy ORES -- Late Jan 2020 | |||
Resolved | Halfak | T235181 Build WikiProject directory topic models for ar, cs, and kowiki | |||
Resolved | Halfak | T235183 Experiment with different vector lengths for ar, cs, en, and kowiki topic models. | |||
Resolved | Halfak | T235187 Create labeled data for topic models in ar, cs, kowiki | |||
Resolved | Isaac | T236713 Improve drafttopic training data pipeline | |||
Resolved | Isaac | T240273 Extract cross-wiki WikiProject tags | |||
Resolved | Halfak | T240286 Re-train English Wikipedia topic model using new WikiProject Taxonomy | |||
Resolved | Halfak | T240276 Restructure WikiProject directory to be better | |||
Resolved | kevinbazira | T240282 Improve WikiProject template --> WikiProject mapping |
Event Timeline
Comment Actions
In T236713: Improve drafttopic training data pipeline, @Isaac has extracted a full dataset of cross-wiki labels as mapped by Wikidata. We can use this to create stratified samples of labeled datasets for use in training models.