Page MenuHomePhabricator

Baseline: Of all vandalism reverts by patrollers, how many are reverted back?
Closed, ResolvedPublic

Description

Research question: What is the efficiency of Automoderator in countering vandalism on wikis?

Evaluation method: While the thresholds for success can vary based on the community, the team would consider the following as successes:

  • Automoderator reverts X% of all actual vandalism [recall: TP/(TP+FN)]
  • Automoderator has a baseline accuracy of 90% when reverting vandalism [precision: TP/(TP+FP)]

Baseline
To find a baseline to compare Automoderator to, we want to see if we can quantify the 'accuracy' of human patrollers. This is going to be hard to get a clear number for, but one tangible figure we want to evaluate here is how many times patrollers are themselves reverted when reverting vandalism. This would imply that they reverted an edit, but another patroller disagreed.


Question: Of all vandalism reverts, how many reverts are reverted by a different patroller?

This analysis should be done individually for the top 10 Wikipedia projects by editing activity: en, es, ja, de, fr, ru, zh, it, pt, fa, plus our pilot id.wiki


Here we will try out our operational definitions:

Vandalism

  • Reverted within 12 hours
  • User edit count less 15 edits
  • Time since user's first edit is less than 48 hours
  • Reverted by a different editor
  • Edit is in the main namespace

Patroller
Users with ...
any of the following user rights:

  • rollback
  • review
  • patrol
  • block
  • delete

Event Timeline

Samwalton9-WMF updated the task description. (Show Details)
Samwalton9-WMF updated the task description. (Show Details)

@Samwalton9-WMF

Results (for 2022)

wiki_dbPercent of Reverts Reverted# Reverts
enwiki3.641628305
eswiki5.25308797
itwiki3.77226128
ruwiki3.22206098
frwiki3.38195289
dewiki1.94170661
jawiki3.4878123
fawiki4.1372164
zhwiki4.8770370
ptwiki3.3635227
idwiki6.534211
  • The notebook has a detailed breakdown by various user group combinations (sysops, non-sysops, users with no extended rights) and also for reverts where the initial edit being reverted was not a revert.

A few notes:

  • Reverts reverted back by anonymous users, users who edit was initially reverted or user has less than 150 edits, they are excluded. I think that aligns with the description, "another patroller disagreed". But for reference, if need be, the notebook has analysis on all reverts as well, in which case the percent of reverts reverted back is around 8-10%.
  • Our definition of patrollers also include users with no extended rights, but having 150+ content edits and 10+ content reverts, however, I couldn't fully apply that. These values are not calculated for mediawiki_history, and for me to calculate on that would be very computationally intensive. For now it is, if a user has 150+ edits overall on a given wiki (at the time of the revert). I will check for ways calculate the required fields as per original definition efficiently.
KCVelaga_WMF renamed this task from Baseline: Of all vandalism reverts, how many reverts are reverted by patrollers? to Baseline: Of all vandalism reverts by patrollers, how many are reverted back?.Jan 2 2024, 9:16 AM
KCVelaga_WMF triaged this task as Medium priority.

So if I'm reading this correctly, we can say that approximately 3-5% of revert decisions made by active editor are contested by another editor.

This is great, thank you!