Page MenuHomePhabricator

[Spike] Explore issues with enwiki articlequality model
Open, LowPublic

Description

See https://en.wikipedia.org/w/index.php?title=User_talk:Jimbo_Wales&oldid=721704824#Underground_lair_minions_wondering_why_articles_on_timezones_are_so_popular

It looks like the model is bad at predicting the quality of lists. It also looks like some articles have a bunch of content and references that come in via transclusion. This seems to be common for articles about TV series that use tables extensively. We may be able to ask the Wikipedia API for a parsed version of the page so that we can circumvent issues with tranclusions that bring substantial content and references.

This task is done when an exploration of these issues is completed and documented in the comments of this task and new tasks are filed for next steps.

Event Timeline

Halfak created this task.May 23 2016, 3:42 PM
Restricted Application added subscribers: Zppix, Aklapper. · View Herald TranscriptMay 23 2016, 3:42 PM

@Ladsgroup suggested that we look into using parsoid output.

Halfak triaged this task as Low priority.Jul 5 2016, 2:30 PM
awight renamed this task from [Spike] Explore issues with enwiki wp10 model to [Spike] Explore issues with enwiki articlequality model.Sep 26 2018, 6:42 PM
Restricted Application added a project: artificial-intelligence. · View Herald TranscriptSep 26 2018, 6:42 PM