Description
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
Enable ORES filters on ukwiki | operations/mediawiki-config | master | +16 -0 | |
Alphabetize ORES settings | operations/mediawiki-config | master | +261 -261 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | None | T130213 [Epic] Edit quality models (damaging/goodfaith) | |||
Duplicate | None | T130294 Deploy edit quality models for ukwiki | |||
Resolved | Tgr | T256887 Enable ORES filters for ukwiki (Ukrainian Wikipedia) |
Event Timeline
@Halfak Hi, is it possible to enable ORES filters (at least as a beta feature for ukwiki) now? What's left? Do you wait for https://phabricator.wikimedia.org/T251571 ?
I don't think anything is blocking the deployment of the filters. This should be on the Growth-Team backlog. @MMiller_WMF, it looks like the UK Wikipedians have been waiting a while for the filters to be turned on in recent changes.
@Halfak -- thanks for bringing this up. I talked about this with @Trizek-WMF, and we're planning to get it on our schedule next week. I had it cooking on our to-do list, but forgot to update you.
Note: the team discussed this today, and needs to return to it next week. We have a full plate this week.
This confused me for a while, but I think I found an OK configuration. The stats are a bit strange though - for other wikis that I have seen precision and recall are more or less an "X" shape for the "bad" outcomes, while here especially for goodfaith precision is not even remotely monotonic, and it's just not possible to reach better precision than 0.6. Is that legit?
damaging=false | damaging = true | goodfaith = false | goodfaith = true |
Anyway the numbers I ended up with:
model | filter | min | max | condition | precision | recall |
damaging | likelygood | 0 | 0.147 | maximum recall @ precision >= 0.997 | 0.997 | 0.899 |
damaging | maybebad | 0.122 | 1 | maximum filter_rate @ recall >= 0.9 (default) | 0.161 | 0.903 |
damaging | likelybad | 0.745 | 1 | maximum recall @ precision >= 0.45 | 0.451 | 0.258 |
damaging | verylikelybad | |||||
goodfaith | likelygood | 0.944 | 1 | maximum recall @ precision >= 0.999 | 0.999 | 0.88 |
goodfaith | maybebad | 0 | 0.777 | maximum recall @ precision >= 0.15 | 0.15 | 0.74 |
goodfaith | likelybad | 0 | 0.301 | maximum recall @ precision >= 0.45 | 0.451 | 0.246 |
goodfaith | verylikelybad | |||||
Damaging verylikelybad was dropped because it would need a precision of ~0.55 to get recall above 0.1, and the guide says we should aim for high precision.
Goodfaith verylikelybad was dropped because precision levels above 0.6 are completely impossible and recall >= 0.1 would take something like 0.48 precision.
Change 655301 had a related patch set uploaded (by Gergő Tisza; owner: Gergő Tisza):
[operations/mediawiki-config@master] Alphabetize ORES settings
Change 655302 had a related patch set uploaded (by Gergő Tisza; owner: Gergő Tisza):
[operations/mediawiki-config@master] Enable ORES filters on ukwiki
I think you've interpreted these graphs correctly, and it means the goodfaith model for this wiki just isn't very good. Unfortunately this is common, especially in cases where bad faith edits are rare in the labeling data.
Your numbers look good to me. You're right that we shouldn't offer verylikelybad filters for either model, because the models don't perform well enough for that. The other filters are set well and behave as expected. The recall for the likelybad filters is low, but that's what happens with poor models like these.
Thanks for checking, @Catrope!
Scheduling deployment to the Tuesday SF morning time slot.
Change 655301 merged by jenkins-bot:
[operations/mediawiki-config@master] Alphabetize ORES settings
Change 655302 merged by jenkins-bot:
[operations/mediawiki-config@master] Enable ORES filters on ukwiki
Mentioned in SAL (#wikimedia-operations) [2021-01-12T19:46:48Z] <tgr@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:655302|Enable ORES filters on ukwiki (T256887)]] (duration: 01m 05s)
Mentioned in SAL (#wikimedia-operations) [2021-01-12T19:48:22Z] <tgr_> synced Config: [[gerrit:655301|Alphabetize ORES settings (T256887)]]
@Ata, @Base, @Trizek-WMF This is now enabled on ukwiki. It might take a while to process all old revision, but for fresh changes you should see ORES filters working. Please report if they are behaving unreasonably.
Checked and moved to PM column for @MMiller_WMF FYI.
- The filters are present (three filters for each model)
- the translation seems to be in place
- the stats on Special:ORESModels looks reasonable
Thank you! I have a general question: if I understand correctly, I am not really aware of the terminology here, you mention above that the model for ukwiki is not as good as it is for other wikis. I assume this will have some consequences for the usability of the filters? Is it immutable or there are ways to improve the model? (Or perhaps it can self improve basing on what is being reverted or some other feed?)
https://www.mediawiki.org/wiki/ORES/Thresholds is a good resource about the terminology.
I assume this will have some consequences for the usability of the filters? Is it immutable or there are ways to improve the model? (Or perhaps it can self improve basing on what is being reverted or some other feed?)
Good question, I'm not sure but I believe you may need to do another labeling campaign (https://www.mediawiki.org/wiki/ORES/Get_support#Advanced_edit_quality_support) to improve the model. Maybe @Halfak knows what the next step is from here.