As identified in Automoderator pilot metrics (T362610), AM only handles less 1% of the total revert workload on Turkish Wikipedia (weekly average since deployment). The team is planning to increase the coverage.
To data, it would be helpful to understand more about the reverts not handled by AM while it is enabled. Having answers to the following questions would be helpful for the team to make a decision:
- Are there any edits not reverted by AM above the currently set 0.99 threshold? If yes, what are they?
- What is the average risk score of edits reverted by users but not Automoderator?
- Count of reverts not reverted at various risk thresholds (0.985, 0.98, 0.975, 0.97, 0.95, 0.9, 0.85, 0.8, 0.75)
- What proportion of those reverts were reverted back? (potential false positives)