[Spike] Investigate building a hook for abuse filter
Open, LowestPublic
Actions

Assigned To

None

Authored By

	Ladsgroup
	Jan 9 2016, 9:56 PM

Description

As an abusefilter maintainer, I would like to be use the likelihood of a revision being damaging and/or goodfaith to decide whether a filter should display a warning, disallow the edit, or do nothing.

This would allow filters to have less false positives than manually constructed rules which take into account only a small subset of features which revscoring models can use.

That would be cool!

Related Objects

Mentioned In: T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context
T299436: How impactful would pre-save automoderation be on edit save times?
T123871: Integrate with mw-core ChangeTags

Event Timeline

Ladsgroup created this task.Jan 9 2016, 9:56 PM

Ladsgroup claimed this task.

Ladsgroup raised the priority of this task from to Needs Triage.

Ladsgroup updated the task description. (Show Details)

Ladsgroup added projects: MediaWiki-extensions-ORES, Machine-Learning-Team (Active Tasks).

Ladsgroup added subscribers: Ladsgroup, Ebrahim.

Restricted Application added subscribers: StudiesWorld, Aklapper. · View Herald TranscriptJan 9 2016, 9:56 PM

Ladsgroup moved this task from Parked to Backlog on the Machine-Learning-Team (Active Tasks) board.Jan 9 2016, 9:58 PM

Ladsgroup moved this task from Backlog to Later on the MediaWiki-extensions-ORES board.Jan 15 2016, 9:39 PM

Ladsgroup mentioned this in T123871: Integrate with mw-core ChangeTags.Jan 21 2016, 2:38 AM

This is technically possible, yes. But I'm skeptical, because AIUI ORES is currently set up to process edits *after* they're saved, how would it work pre-save? Use PST wikitext? HTML? What about content that is vary-revision?

Also we're working on cutting down save time a lot, and adding an extra network request may work counter to that goal...

These are pretty valid points, thank you for this

Ladsgroup triaged this task as Lowest priority.Jan 21 2016, 10:02 AM

Ladsgroup set Security to None.

Lydia_Pintscher added subscribers: Lydia_Pintscher, hoo.Jan 22 2016, 1:24 PM

Ladsgroup moved this task from Backlog to Completed on the Machine-Learning-Team (Active Tasks) board.Jan 24 2016, 5:23 PM

Ladsgroup moved this task from Later to Done on the MediaWiki-extensions-ORES board.Jan 27 2016, 2:05 AM

Ladsgroup closed this task as Declined.Feb 2 2016, 12:29 PM

• Phabricator_maintenance added a project: User-Ladsgroup.Aug 12 2016, 8:09 PM

He7d3r awarded a token.Oct 23 2016, 11:32 AM

I would like this to be reconsidered, as I don't see the problem: there are many features¹ in revscoring which seems to be available to AbuseFilter already as variables, or which could be obtained from wherever AF gets them right now.

At most we would need to train specific models not requiring any feature for which the date is only available after saving the page. Anyway, this would provide AbuseFilter a model which is a lot better than our current strategies of checking *only* the namespace and the number of bytes in each revision:
https://pt.wikipedia.org/wiki/Special:AbuseFilter/3
https://pt.wikipedia.org/wiki/Special:AbuseFilter/119

¹ "feature" in this sense: https://github.com/wiki-ai/revscoring/blob/master/revscoring/features/wikitext/features/chars.py

He7d3r added a project: AbuseFilter.Oct 23 2016, 11:54 AM

He7d3r updated the task description. (Show Details)

@He7d3r, would you like to take on this task?

Halfak edited projects, added Machine-Learning-Team; removed Machine-Learning-Team (Active Tasks).Oct 27 2016, 2:38 PM

Halfak moved this task from Unsorted to Ideas on the Machine-Learning-Team board.Oct 27 2016, 2:43 PM

Ladsgroup removed a project: User-Ladsgroup.Oct 30 2016, 2:07 AM

Unfortunately I don't have the time to explore this currently... :-(

Ricordisamoa subscribed.Feb 25 2017, 1:18 AM

matej_suchanek moved this task from Done to Backlog on the MediaWiki-extensions-ORES board.Mar 3 2017, 7:08 PM

TheresNoTime subscribed.Dec 7 2017, 2:36 PM

Harej moved this task from Ideas to Epic on the Machine-Learning-Team board.Apr 3 2019, 4:53 AM

Ladsgroup unsubscribed.Apr 17 2019, 7:13 PM

• ACraze moved this task from Epic to Backlog/ORES on the Machine-Learning-Team board.Jan 19 2021, 8:45 PM

Strainu awarded a token.Jan 13 2022, 7:45 PM

Strainu subscribed.

Stang subscribed.Jan 20 2022, 4:34 PM

In T123178#1951308, @Legoktm wrote:

This is technically possible, yes. But I'm skeptical, because AIUI ORES is currently set up to process edits *after* they're saved, how would it work pre-save? Use PST wikitext? HTML? What about content that is vary-revision?

I take this back. Something would be better than nothing.

Strainu mentioned this in T299436: How impactful would pre-save automoderation be on edit save times?.Feb 19 2022, 3:19 PM

DannyS712 subscribed.Feb 19 2022, 8:13 PM

kostajh mentioned this in T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.Jan 31 2024, 9:59 AM

kostajh subscribed.

[Spike] Investigate building a hook for abuse filterOpen, LowestPublicActions

Description

Related Objects

Event Timeline

[Spike] Investigate building a hook for abuse filter
Open, LowestPublic
Actions