Question: What proportion of human-edits will need to be reviewed if we want 95% recall?
Methods:
- Gather random sample of edits
- Label reverted
- Explore dataset of non-reverted damage
Question: What proportion of human-edits will need to be reviewed if we want 95% recall?
Methods:
Started some work here. This is based off of a random sample of wikidata edits
https://etherpad.wikimedia.org/p/revscoring_wikidata_reverted_set
OK. If we draw the cutoff at 0.93, we'll catch 100/10000 edits and that will account for (as far as we can tell) all of the vandalism!
I made some analysis and posted it in https://www.wikidata.org/wiki/Wikidata_talk:ORES/Report_mistakes.