The scope should be kept minimal here. We need to rewrite the pipeline pieces which fetch all pages with embedded wikiproject templates, and the code to generate a list of revisions to train on. Previous code was difficult to run repeatably due to a dependency on PAWS, and it seems we were training on a random talk page revision, when we should have been looking for the first revision of the content page linked to the talk page.
Description
Description
Event Timeline
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMay 2 2018, 12:20 AM2018-05-02 00:20:57 (UTC+0)
awight edited projects, added Machine-Learning-Team; removed Machine-Learning-Team (Active Tasks).Jun 11 2018, 10:41 PM2018-06-11 22:41:33 (UTC+0)
• Vvjjkkii renamed this task from Rewrite draft topic scripts to fetch linked pages and prepare training data to ptdaaaaaaa.Jul 1 2018, 1:12 AM2018-07-01 01:12:31 (UTC+0)
CommunityTechBot renamed this task from ptdaaaaaaa to Rewrite draft topic scripts to fetch linked pages and prepare training data.Jul 1 2018, 9:32 AM2018-07-01 09:32:02 (UTC+0)
CommunityTechBot raised the priority of this task from High to Needs Triage.Jul 3 2018, 1:57 AM2018-07-03 01:57:16 (UTC+0)
• ACraze moved this task from Maintenance/cleanup to Backlog/ORES on the Machine-Learning-Team board.Jan 19 2021, 10:32 PM2021-01-19 22:32:24 (UTC+0)