AbuseFilter: Indicate that an edit was a revert
Open, Stalled, Needs TriagePublic
Actions

Assigned To

None

Authored By

	matej_suchanek
	Mar 6 2017, 5:25 PM

Description

After all core hooks have been updated, it should be easy to add a boolean variable that indicates whether an edit was reverted. So instead of checking summary for specific substrings, the code will simplify to &! is_revert (or so).

Related Objects
Search...

Status	Assigned	Task
Stalled	None	T159725 AbuseFilter: Indicate that an edit was a revert
Resolved	Ostrzyciel	T152434 Add method to Revision to check if it was a Revert, and whether an edit was Reverted
Resolved	Ostrzyciel	T216297 Develop method for identifying reverts in EventBus data
Resolved	Ostrzyciel	T254074 Implement the reverted edit tag
Resolved	Ostrzyciel	T252366 Choose the definition of a "reverted" edit
Resolved	Ostrzyciel	T256001 Detect manual reverts
Resolved	Ostrzyciel	T258951 RevisionStore: add getRevisionIdsBetween method
Resolved	Ostrzyciel	T258952 PageUpdater: implement EditResult class
Resolved	Ostrzyciel	T257215 Deprecate RollbackComplete hook
Resolved	Ostrzyciel	T256915 EditResult's revert is not marked correctly when using undoafter param
Resolved	Ostrzyciel	T259732 EditResult: enable serialization
Resolved	Ostrzyciel	T259014 Protect the reverted edits feature from abuse
Resolved	Ostrzyciel	T259103 Run reverted tag update job only after the edit is approved
Resolved	Ostrzyciel	T259733 PageUpdater: save additional info about reverts in ct_params
Resolved	Ostrzyciel	T260524 Implement BeforeRevertedTagUpdate hook in FlaggedRevs

Event Timeline

matej_suchanek created this task.Mar 6 2017, 5:25 PM

Restricted Application added subscribers: TerraCodes, Aklapper. · View Herald TranscriptMar 6 2017, 5:25 PM

matej_suchanek changed the task status from Open to Stalled.Mar 6 2017, 5:26 PM

matej_suchanek moved this task from To Triage to Not ready to announce on the User-notice board.

matej_suchanek moved this task from Backlog to Filtering features on the AbuseFilter board.

@matej_suchanek: good first task tasks are self-contained, non-controversial issues with a clear approach and should be well-described with pointers to help the new contributor. Given the current short task description I'm removing the good first task tag. Please re-add the tag once the task description has been polished and provides sufficient information for a new contributor. Thanks!

Jdlrobson added a parent task: T152434: Add method to Revision to check if it was a Revert, and whether an edit was Reverted.Apr 3 2017, 7:46 PM

• Mattflaschen-WMF renamed this task from Indicate that an edit was a revert to AbuseFilter: Indicate that an edit was a revert.Aug 23 2017, 8:53 PM

• Mattflaschen-WMF removed a parent task: T152434: Add method to Revision to check if it was a Revert, and whether an edit was Reverted.

• Mattflaschen-WMF added a subtask: T152434: Add method to Revision to check if it was a Revert, and whether an edit was Reverted.

I'm reopening this, as significant progress has been made to make this possible.

PageUpdater now constructs an EditResult object when saving the new revision and makes it available through the PageSaveComplete hook. The object provides all necessary information to determine whether the edit was a revert and much more, there's tons of potential for AbuseFilter. The problem is – this hook is called after the revision is saved, and the call is made by PageUpdater, not EditPage, which belongs to a different abstraction layer. It may be possible to pass the EditResult before saving the revision (and thus allow for aborting it) using the MultiContentSave hook, but that would still be done from within PageUpdater.

My main problem here is that EditPage and WikiPage are a tangled mess and it's hard for me to propose something sensible. If EditPage controlled PageUpdater directly, without using WikiPage's doEditContent (which is deprecated BTW), it may be possible to split PageUpdater::saveRevision into two parts:

Prepare the edit to be saved and construct the EditResult.
Save the edit.

In such a setup we would be able to call AbuseFilter between these methods and pass EditResult to it properly. And from the right abstraction layer.

Idk, this is my best idea on how to solve this. Someone may come up with something better :)

Briilliant idea!

I think we should untangle the EditPage mess first (remove the use of deprecated doEditContent, then split it into two parts as you explained). It might be better to have a subtask for it.

@Daimona before committing to any work, I wanted to invite you to take a look here. Could this fall within the scope of the grant you got funded for by WMF?

Untangling that mess would be nice, but it doesn't seem to be easy, see T208801: Support slots other than the main slot in EditPage - backend support for a 2018 attempt on doing that.
I think @DannyS712 is also attempting to take this on in this WIP patch.

Doing a big refactor of EditPage first would be probably more sensible than trying to mess around with the current implementation. We'd just have to ensure doing something like described above would be possible.

Oh, now I got another idea. We could also move the AbuseFilter to the storage layer and have PageUpdater return AbuseFilter-induced errors in its Status object. That would work as well, but I have no idea how much refactoring that would require. Probably a lot.

In T159725#6324363, @Ostrzyciel wrote:

Untangling that mess would be nice, but it doesn't seem to be easy, see T208801: Support slots other than the main slot in EditPage - backend support for a 2018 attempt on doing that.
I think @DannyS712 is also attempting to take this on in this WIP patch.

Doing a big refactor of EditPage first would be probably more sensible than trying to mess around with the current implementation. We'd just have to ensure doing something like described above would be possible.

That WIP patch you linked to is intended to be a part of T157658: Factor out a backend from EditPage and should not change the interface for extensions to prevent or modify an edit via the EditFilterMergedContent hook. In short, it moves away from EditPage::internalAttemptSave to a dedicated backend for saving new edits, along with all of the possible reasons edits would be perverted (hooks, edit conflict, page doesn't exist, etc)

In T159725#6324301, @Huji wrote:

I think we should untangle the EditPage mess first (remove the use of deprecated doEditContent, then split it into two parts as you explained). It might be better to have a subtask for it.

I believe this is a great idea.

@Daimona before committing to any work, I wanted to invite you to take a look here. Could this fall within the scope of the grant you got funded for by WMF?

Well, this is a good example of task that is trivial to implement on the AF side, but hard overall due to core limitations. If the behaviour changes in core, then it would be in scope, but it would also be a trivial task that could be addressed outside of the grant. OTOH, changing MW core is probably out of scope (especially for a change this big).

Daimona mentioned this in T262157: Make rollbacks go through AbuseFilter.Sep 6 2020, 11:42 AM

I think it should always be possible to use $this->undidRev in EditPage::runPostMergeFilters. However, that only represents the revID in the URL (undo=xxx). EditPage checks whether the revert is clean (see EditPage::isUndoClean), but that happens *after* running the EditFilterMergedContent hook. I'm not comfortable moving that code around, given that we're talking about a 500 lines long method (EditPage::internalAttemptSave). Also, it would not distinguish manual revert vs undo button, but I don't think that's important.

Ostrzyciel closed subtask T152434: Add method to Revision to check if it was a Revert, and whether an edit was Reverted as Resolved.Sep 15 2020, 7:29 AM

Jules78120 awarded a token.Sep 2 2021, 9:48 AM

Jules78120 subscribed.

This is much needed: on fr-wp, for example, we have a couple of banned people harassing volunteers by reverting their edits. And there is no efficient and clean way to handle that currently with AF.

In T159725#7330472, @Jules78120 wrote:

And there is no efficient and clean way to handle that currently with AF.

That's because MediaWiki core runs abuse filters (and similar things like SpamBlacklist) before determining whether the edit is a revert. That is, by the time filters are executed, nobody knows whether the current edit is a revert. Until this is changed in MW core, there's nothing we can do on the AF side.

As mentioned above, the relevant core code really is a mess, and even if there've been several improvements recently, I think it's still far from what we'd need to complete this task. I'm not even sure which task(s) should I block this task on, if any.

AbuseFilter: Indicate that an edit was a revertOpen, Stalled, Needs TriagePublicActions

Description

Related ObjectsSearch...

Event Timeline

AbuseFilter: Indicate that an edit was a revert
Open, Stalled, Needs TriagePublic
Actions

Related Objects
Search...