[Spike] Explore issues with enwiki articlequality model
Open, LowPublicSpike
Actions

Assigned To

None

Authored By

	Halfak
	May 23 2016, 3:42 PM

Description

See https://en.wikipedia.org/w/index.php?title=User_talk:Jimbo_Wales&oldid=721704824#Underground_lair_minions_wondering_why_articles_on_timezones_are_so_popular

It looks like the model is bad at predicting the quality of lists. It also looks like some articles have a bunch of content and references that come in via transclusion. This seems to be common for articles about TV series that use tables extensively. We may be able to ask the Wikipedia API for a parsed version of the page so that we can circumvent issues with tranclusions that bring substantial content and references.

This task is done when an exploration of these issues is completed and documented in the comments of this task and new tasks are filed for next steps.

Event Timeline

Halfak created this task.May 23 2016, 3:42 PM

Restricted Application added subscribers: Zppix, Aklapper. · View Herald TranscriptMay 23 2016, 3:42 PM

@Ladsgroup suggested that we look into using parsoid output.

Halfak moved this task from Unsorted to New development on the Machine-Learning-Team board.Jun 6 2016, 5:00 PM

Halfak triaged this task as Low priority.Jul 5 2016, 2:30 PM

Danny_B added a project: Spike.Jul 5 2016, 6:35 PM

Halfak moved this task from New development to Research & analysis on the Machine-Learning-Team board.Sep 22 2016, 2:50 PM

awight renamed this task from [Spike] Explore issues with enwiki wp10 model to [Spike] Explore issues with enwiki articlequality model.Sep 26 2018, 6:42 PM

awight added a project: articlequality-modeling.

Restricted Application added a project: artificial-intelligence. · View Herald TranscriptSep 26 2018, 6:42 PM

Halfak edited projects, added Machine-Learning-Team (Research); removed Machine-Learning-Team.Apr 2 2019, 9:33 PM

Restricted Application edited projects, added Machine-Learning-Team; removed Machine-Learning-Team (Research). · View Herald TranscriptApr 2 2019, 9:33 PM

Harej edited projects, added Machine-Learning-Team (Research); removed Machine-Learning-Team.Apr 3 2019, 4:33 AM

Ladsgroup unsubscribed.Apr 17 2019, 7:25 PM

calbon removed a project: Machine-Learning-Team (Research).Sep 23 2020, 4:40 PM

Restricted Application changed the subtype of this task from "Task" to "Spike". · View Herald TranscriptSep 23 2020, 4:40 PM

[Spike] Explore issues with enwiki articlequality modelOpen, LowPublicSpikeActions

Description

Event Timeline

[Spike] Explore issues with enwiki articlequality model
Open, LowPublicSpike
Actions