Maniphest T197012

Enable srwiki edit quality filters in RecentChanges
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	awight
	Jun 12 2018, 3:20 PM

Description

The srwiki advanced edit quality models are deployed and ready for RC integration.

Details

	Subject	Repo	Branch	Lines +/-
	Enable ORES edit quality filters on srwiki (damaging only)	operations/mediawiki-config	master	+18 -0

Customize query in gerrit

Related Objects
Search...

Status	Assigned	Task
Resolved	Catrope	T197012 Enable srwiki edit quality filters in RecentChanges
Resolved	awight	T194745 Train / test reverted model for srwiki
Resolved	Ladsgroup	T174687 Add language support for Serbian
Resolved	Halfak	T199355 Investigate srwiki goodfaith model, why is it so bad?
Resolved	None	T220556 New labeling campaign for srwiki

Event Timeline

awight created this task.Jun 12 2018, 3:20 PM

Restricted Application added subscribers: • Petar.petkovic, Aklapper. · View Herald TranscriptJun 12 2018, 3:20 PM

awight added a subtask: T194745: Train / test reverted model for srwiki.Jun 12 2018, 8:08 PM

The model is now deployed. @Acamicamacaraca has been asking (at T174687) when we can get the filters enabled.

Halfak renamed this task from Enable srwiki edit quality features to Enable srwiki edit quality filters in RecentChanges.Jun 12 2018, 8:46 PM

Halfak unsubscribed.

Kizule subscribed.Jun 12 2018, 9:44 PM

JTannerWMF assigned this task to Catrope.Jun 13 2018, 2:37 PM

JTannerWMF moved this task from Untriaged to Ready for Pickup on the Collaboration-Team-Triage (Collab-Team-This-Quarter) board.

So, that is final task. I hope that Collaboration Team will enable filters soon.

Ladsgroup moved this task from Unsorted to Backlog/Lift Wing on the Machine-Learning-Team board.Jun 20 2018, 5:28 AM

Aca awarded a token.Jun 21 2018, 4:26 PM

Kizule awarded a token.Jun 25 2018, 6:34 AM

@Catrope How's it going? xD

• Vvjjkkii renamed this task from Enable srwiki edit quality filters in RecentChanges to r6aaaaaaaa.Jul 1 2018, 1:04 AM

• Vvjjkkii removed Catrope as the assignee of this task.

• Vvjjkkii triaged this task as High priority.

• Vvjjkkii added projects: CheckUser, Connected-Open-Heritage-Batch-uploads (RAÄ-KMB_1_2017-02), Tamil-Sites, Gamepress, Hashtags, Jade, KartoEditor, Language-2018-Apr-June, New-Editor-Experiences, Mail, TCB-Team (now WMDE-TechWish).

• Vvjjkkii updated the task description. (Show Details)

• Vvjjkkii removed a subscriber: Aklapper.

• Vvjjkkii reopened subtask T194745: Train / test reverted model for srwiki as Open.Jul 1 2018, 1:10 AM

CommunityTechBot renamed this task from r6aaaaaaaa to Enable srwiki edit quality filters in RecentChanges.Jul 2 2018, 2:04 PM

CommunityTechBot assigned this task to Catrope.

CommunityTechBot raised the priority of this task from High to Needs Triage.

CommunityTechBot updated the task description. (Show Details)

CommunityTechBot removed projects: TCB-Team (now WMDE-TechWish), Mail, New-Editor-Experiences, Language-2018-Apr-June, KartoEditor, Jade, Hashtags, Gamepress, Tamil-Sites, Connected-Open-Heritage-Batch-uploads (RAÄ-KMB_1_2017-02), CheckUser.

CommunityTechBot added a subscriber: Aklapper.

CommunityTechBot closed subtask T194745: Train / test reverted model for srwiki as Resolved.Jul 2 2018, 4:10 PM

Catrope merged a task: T195870: Enable ORES filters on srwiki.Jul 4 2018, 1:01 AM

Catrope added a subscriber: Halfak.

In T197012#4319768, @Acamicamacaraca wrote:

@Catrope How's it going? xD

Sorry for the delay, this caught me right in the middle of a busy time.

I looked at the properties of the srwiki model, and while the damaging model is usable, the goodfaith model is not. The highest precision for bad faith that this model can achieve is 23.1% (see queries for >=0.23 and >=0.24), which means we could implement a "may be bad faith" filter (which would have 16.8% precision at 62.5% recall) but not a "likely bad faith" or "very likely bad faith" model, because we want those to have a precision of at least 45% and 60% respectively, and ideally 60% and 90%.

The damaging model is adequate though; it's not the best model we have, but it's workable. We could configure the following filters:

Very likely good: 99.5% precision at 100% (?!) recall, or alternatively 100% precision at 90.7% recall
May be bad: 15.5% precision at 90.1% recall (we aim for 90% recall or 15% precision, so this fits that perfectly)
Likely bad: 45.7% precision at 39.9% recall (normally we aim for 60% precision, but 45% is fine for lower-fit models)
Very likely bad: 75% precision at 17.5% recall (normally we aim for 90% precision, but that would lead to 5.7% recall which I think is too low)

@awight The last time we ran into this situation, on T192498: Deploy ORES advanced editquality models to arwiki, I ended up deploying only the damaging model but not the goodfaith model, and you said that we should consider not deploying (or undeploying) low quality models. How do you feel about this case?

Change 444018 had a related patch set uploaded (by Catrope; owner: Catrope):
[operations/mediawiki-config@master] Enable ORES edit quality filters on srwiki (damaging only)

https://gerrit.wikimedia.org/r/444018

gerritbot added a project: Patch-For-Review.Jul 5 2018, 4:49 PM

Catrope edited projects, added Growth-Team (Sprint 0 (Growth Team)); removed Collaboration-Team-Triage (Collab-Team-This-Quarter).Jul 5 2018, 6:00 PM

Catrope moved this task from Incoming to In Progress on the Growth-Team (Sprint 0 (Growth Team)) board.Jul 5 2018, 6:04 PM

If there are no objections, I'm going to deploy this on Monday July 9th at 18:00-19:00 UTC. (cc @Acamicamacaraca )

@Catrope What are you thinking about restarting edits review. Maybe we can get better-quality filters in second try?

Is that possible?

Change 444018 merged by jenkins-bot:
[operations/mediawiki-config@master] Enable ORES edit quality filters on srwiki (damaging only)

https://gerrit.wikimedia.org/r/444018

Mentioned in SAL (#wikimedia-operations) [2018-07-09T19:01:20Z] <catrope@deploy1001> Synchronized wmf-config/InitialiseSettings.php: Enable ORES damaging filter on srwiki (T197012) (duration: 00m 50s)

• jmatazzoni unsubscribed.Jul 10 2018, 1:11 AM

Thank you very much! Works correctly for now!

In T197012#4407757, @Acamicamacaraca wrote:

@Catrope What are you thinking about restarting edits review. Maybe we can get better-quality filters in second try?

That's a question for @awight and @Halfak , they're the ORES experts. I'm just the Recent Changes guy :)

We can definitely do a second round of labels. I'd like to have @notconfusing look for any anomalies in the labeled data though so we can see if there are inconsistencies. I would expect that the goodfaith classifier would work better!

I just wanted to post on this task to clarify its status. Given that the Growth team has enabled the filters, and the remainder of the conversation is about improvements to the models, I'm going to resolve this ticket. @Halfak, is there a separate task where it could be good to have that conversation?

awight mentioned this in T199355: Investigate srwiki goodfaith model, why is it so bad?.Jul 11 2018, 6:18 PM

@awight created T199355 to look into the model itself. Thanks!

• Petar.petkovic removed a project: Patch-For-Review.Jul 11 2018, 10:26 PM

Liuxinyu970226 added a project: Serbian-Sites.Jan 2 2019, 3:13 PM

Halfak closed subtask T199355: Investigate srwiki goodfaith model, why is it so bad? as Resolved.Jun 18 2019, 1:39 PM

Aca moved this task from Backlog to Closed on the Serbian-Sites board.Aug 3 2021, 10:44 AM