Investigate training ORES with AbuseFilter conditions
Open, HighPublic
Actions

Assigned To

None

Authored By

	dbarratt
	Feb 1 2018, 7:44 PM

Description

AbuseFilter contains 192 filters on English Wikipedia.

These filters can tell us:

What edits constitute as abuse.
What was used to determine that they are abuse.
The severity of the abuse.

We should investigate using this data to better train ORES to detect vandalism.

With this data, perhaps it would be possible for ORES to be better at identifying what is vandalism and with any hope exceed the reliability of AbuseFilter at identifying and preventing abuse.

Related Objects

Mentioned In: T185154: AbuseFilter (and dependencies): code stewardship review
Mentioned Here: T30213: AbuseFilter should let users to mark log entries as false positives

Event Timeline

dbarratt created this task.Feb 1 2018, 7:44 PM

Restricted Application added a project: Machine-Learning-Team. · View Herald TranscriptFeb 1 2018, 7:44 PM

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

• TBolliger updated the task description. (Show Details)Feb 1 2018, 7:48 PM

• TBolliger subscribed.

dbarratt mentioned this in T185154: AbuseFilter (and dependencies): code stewardship review.Feb 1 2018, 8:05 PM

He7d3r subscribed.Feb 2 2018, 9:59 AM

I agree, the idea is good, though we would need some ways to set a "quality" for each filter, e.g. T30213.

awight moved this task from Unsorted to Research & analysis on the Machine-Learning-Team board.Jun 20 2018, 3:02 PM

Halfak edited projects, added Machine-Learning-Team (Research); removed Machine-Learning-Team.Apr 2 2019, 9:33 PM

Harej triaged this task as High priority.Apr 9 2019, 9:21 PM

calbon removed a project: Machine-Learning-Team (Research).Sep 23 2020, 4:32 PM

Ahmad252 subscribed.Oct 24 2020, 7:34 PM

Investigate training ORES with AbuseFilter conditionsOpen, HighPublicActions

Description

Related Objects

Event Timeline

Investigate training ORES with AbuseFilter conditions
Open, HighPublic
Actions