Turns out that cuts out only 500 edits -- from 10727 to 10214. That's a lot of edits to label. We want to get this down to about 5k at the most. I'll try cutting the number of "trusted edits" down to 200.

May 9 2019, 4:08 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), Spanish-Sites, editquality-modeling, Wikilabels, artificial-intelligence

Halfak added a comment to T209670: Create editquality campaign for Spanish Wikiversity.

OK let me re-run. We were already considering admins "trusted" but I'll see how much of a difference it makes to include your edits.

May 9 2019, 2:29 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), Spanish-Sites, editquality-modeling, Wikilabels, artificial-intelligence

Halfak added a comment to T209670: Create editquality campaign for Spanish Wikiversity.

@Lsanabria, I'm still waiting on your response to my last questions. No rush. Just want to make sure you know I'm blocked on you taking a look.

May 9 2019, 1:59 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), Spanish-Sites, editquality-modeling, Wikilabels, artificial-intelligence

May 8 2019

GitHub <noreply@github.com> committed rOWC5260f9bf4ebf: Merge c80a210209e6cabbbbe5a0659f2fbed2e4180d8a into… (authored by Halfak).

Merge c80a210209e6cabbbbe5a0659f2fbed2e4180d8a into…

May 8 2019, 3:36 PM

Halfak added a comment to T215354: Enable ORES RCFilters for German Wikipedia (dewiki).

You can get fitness statistics directly from the service. Here's some relevant statistics for the English Wikipedia models and the German Wikipedia models.

May 8 2019, 2:21 PM · Growth-Team-Filtering, Growth-Team, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters

Halfak added a comment to T202202: Build article quality model for svwiki.

Sorry for the delay. I've been on vacation for almost a week. I'll be looking to get this deployed some time this week and then I'll set you up with a simple gadget that will help you explore the quality of the predictions. I'll ping back here with updates.

May 8 2019, 2:07 PM · User-Sebastian_Berlin-WMSE, Machine-Learning-Team (Active Tasks), WMSE-Development-Support-2019 (Automatic article quality assessment), articlequality-modeling, Wikilabels, artificial-intelligence

May 1 2019

Halfak updated subscribers of T222298: Wikilabels assigns empty worksets.

May 1 2019, 7:17 PM · Wikilabels, Machine-Learning-Team

Halfak created T222298: Wikilabels assigns empty worksets.

May 1 2019, 7:17 PM · Wikilabels, Machine-Learning-Team

Halfak committed rORESDEPLOY52e9759ac46c: Updates ORES to head. Re. T222121.

Updates ORES to head. Re. T222121

May 1 2019, 7:16 PM

Halfak added a comment to T222270: ores-support-checklist is down.

I can run the venv and import "encodings"

May 1 2019, 3:27 PM · User-Ladsgroup, ORES-Support-Checklist, Machine-Learning-Team (Active Tasks)

Halfak added a comment to T222270: ores-support-checklist is down.

I found this at the end of the uwsgi.log:

May 1 2019, 3:26 PM · User-Ladsgroup, ORES-Support-Checklist, Machine-Learning-Team (Active Tasks)

Halfak added a comment to T222271: Document and share operational details of ores-support-checklist.

Looks like I can "become ores-support-checklist"

May 1 2019, 3:21 PM · ORES-Support-Checklist, Machine-Learning-Team

Halfak added a comment to T222271: Document and share operational details of ores-support-checklist.

Found some docs here: https://github.com/wikimedia/ores-support-checklist

May 1 2019, 3:19 PM · ORES-Support-Checklist, Machine-Learning-Team

Halfak created T222271: Document and share operational details of ores-support-checklist.

May 1 2019, 3:18 PM · ORES-Support-Checklist, Machine-Learning-Team

Halfak added a comment to T222270: ores-support-checklist is down.

Marking this high instead of "unbreak" because it's not a critical service and only serves informational needs.

May 1 2019, 3:15 PM · User-Ladsgroup, ORES-Support-Checklist, Machine-Learning-Team (Active Tasks)

Halfak triaged T222270: ores-support-checklist is down as High priority.

May 1 2019, 3:15 PM · User-Ladsgroup, ORES-Support-Checklist, Machine-Learning-Team (Active Tasks)

Halfak created T222270: ores-support-checklist is down.

May 1 2019, 3:14 PM · User-Ladsgroup, ORES-Support-Checklist, Machine-Learning-Team (Active Tasks)

Apr 30 2019

Halfak claimed T199355: Investigate srwiki goodfaith model, why is it so bad?.

Apr 30 2019, 8:57 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

Halfak added a comment to T199355: Investigate srwiki goodfaith model, why is it so bad?.

In the meantime, I added the campaign here: https://labels.wmflabs.org/ui/srwiki/ Please pick up these edits and re-label them as we were doing in the etherpad. Once we're done with this, we can re-examine the data and update the training/testing set.

Apr 30 2019, 8:57 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

GitHub <noreply@github.com> committed rOEQ373b4c143ec4: Merge pull request #193 from wikimedia/fiwiki_v2 (authored by Halfak).

Merge pull request #193 from wikimedia/fiwiki_v2

Apr 30 2019, 8:56 PM

Halfak added a comment to T199355: Investigate srwiki goodfaith model, why is it so bad?.

OK it's clear that we would benefit from re-labeling these 500 revisions using Wiki labels. I'm working to get a campaign loaded. I'd like to call it something like "Edit quality (500 edits re-review)" or something like that. Could someone help me get a Serbian translation of that?

Apr 30 2019, 8:50 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

GitHub <noreply@github.com> committed rORES7af9f0f2df70: Merge pull request #327 from wikimedia/root_features_fix (authored by Halfak).

Merge pull request #327 from wikimedia/root_features_fix

Apr 30 2019, 8:24 PM

Halfak moved T202202: Build article quality model for svwiki from Parked to Review on the Machine-Learning-Team (Active Tasks) board.

Apr 30 2019, 7:59 PM · User-Sebastian_Berlin-WMSE, Machine-Learning-Team (Active Tasks), WMSE-Development-Support-2019 (Automatic article quality assessment), articlequality-modeling, Wikilabels, artificial-intelligence

GitHub <noreply@github.com> committed rOWCc108737638ae: Merge 918882ec2ed948e24247ca4dadc892fc06e309df into… (authored by Halfak).

Merge 918882ec2ed948e24247ca4dadc892fc06e309df into…

Apr 30 2019, 5:22 PM

Halfak committed rOWCb84de7cbb90b: Adds Makefile stuff for svwiki model..

Adds Makefile stuff for svwiki model.

Apr 30 2019, 5:22 PM

Halfak committed rOWC2066b1bc6ed7: Adds svwiki feature_list.

Adds svwiki feature_list

Apr 30 2019, 5:22 PM

Aaron Halfaker <ahalfaker@wikimedia.org> committed rOWC49f90aa1e60f: Adds Makefile stuff for svwiki model. (authored by Halfak).

Adds Makefile stuff for svwiki model.

Apr 30 2019, 5:22 PM

GitHub <noreply@github.com> committed rOWC48e9564a9cb8: Merge pull request #81 from gi11es/patch-1 (authored by Halfak).

Merge pull request #81 from gi11es/patch-1

Apr 30 2019, 5:22 PM

Halfak added a comment to T202202: Build article quality model for svwiki.

OK we have a model. Fitness isn't really that great, but it'll be interesting to see how it works in practice.

Apr 30 2019, 5:14 PM · User-Sebastian_Berlin-WMSE, Machine-Learning-Team (Active Tasks), WMSE-Development-Support-2019 (Automatic article quality assessment), articlequality-modeling, Wikilabels, artificial-intelligence

Halfak added a comment to T202202: Build article quality model for svwiki.

Excellent! Thank you.

Apr 30 2019, 2:00 PM · User-Sebastian_Berlin-WMSE, Machine-Learning-Team (Active Tasks), WMSE-Development-Support-2019 (Automatic article quality assessment), articlequality-modeling, Wikilabels, artificial-intelligence

Halfak added a comment to T202202: Build article quality model for svwiki.

No worries. I can work from this. Do you have any datasets extracted that I could work from? Or maybe the extractor is just fast enough to run again.

Apr 30 2019, 1:54 PM · User-Sebastian_Berlin-WMSE, Machine-Learning-Team (Active Tasks), WMSE-Development-Support-2019 (Automatic article quality assessment), articlequality-modeling, Wikilabels, artificial-intelligence

GitHub <noreply@github.com> committed rORES8f224e5c9990: Merge bae07fcd1f75e818f4d7e314753cdbdef2be9128 into… (authored by Halfak).

Merge bae07fcd1f75e818f4d7e314753cdbdef2be9128 into…

Apr 30 2019, 12:06 AM

Halfak committed rORESbae07fcd1f75: Fixes T222121.

Fixes T222121

Apr 30 2019, 12:06 AM

Halfak closed T222121: Non-root features no longer being injected. as Resolved by committing rORESbae07fcd1f75: Fixes T222121.

Apr 30 2019, 12:06 AM · Patch-For-Review, ORES, Machine-Learning-Team (Active Tasks)

Apr 29 2019

Halfak triaged T222121: Non-root features no longer being injected. as High priority.

Apr 29 2019, 11:31 PM · Patch-For-Review, ORES, Machine-Learning-Team (Active Tasks)

Halfak created T222121: Non-root features no longer being injected. .

Apr 29 2019, 11:31 PM · Patch-For-Review, ORES, Machine-Learning-Team (Active Tasks)

Halfak added a comment to T202202: Build article quality model for svwiki.

@Gilles, are you still working on this task?

Apr 29 2019, 4:09 PM · User-Sebastian_Berlin-WMSE, Machine-Learning-Team (Active Tasks), WMSE-Development-Support-2019 (Automatic article quality assessment), articlequality-modeling, Wikilabels, artificial-intelligence

Halfak added a subtask for T219238: Remove SpecialContributions::getForm::filters hook call: T117736: Convert Special:Contributions to OOUI.

Apr 29 2019, 4:07 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), Patch-For-Review, MediaWiki-extensions-ORES, Machine-Learning-Team (Active Tasks), User-Jdlrobson, User-Ladsgroup

Halfak added a parent task for T117736: Convert Special:Contributions to OOUI: T219238: Remove SpecialContributions::getForm::filters hook call.

Apr 29 2019, 4:07 PM · User-notice-archive, Web-Team-Backlog (Kanbanana-2019-20-Q2), MediaWiki-Special-pages, User-Jdlrobson, UI-Standardization-Kanban, UI-Standardization

Halfak moved T209670: Create editquality campaign for Spanish Wikiversity from Review to Parked on the Machine-Learning-Team (Active Tasks) board.

Apr 29 2019, 4:04 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), Spanish-Sites, editquality-modeling, Wikilabels, artificial-intelligence

Halfak claimed T209670: Create editquality campaign for Spanish Wikiversity.

Apr 29 2019, 4:04 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), Spanish-Sites, editquality-modeling, Wikilabels, artificial-intelligence

Halfak moved T212379: Jade Wireframes: Entity view mode from Parked to Review on the Machine-Learning-Team (Active Tasks) board.

Apr 29 2019, 4:03 PM · Machine-Learning-Team (Active Tasks), Design, Jade

Halfak moved T212379: Jade Wireframes: Entity view mode from Review to Parked on the Machine-Learning-Team (Active Tasks) board.

Apr 29 2019, 4:03 PM · Machine-Learning-Team (Active Tasks), Design, Jade

Halfak added a comment to T209670: Create editquality campaign for Spanish Wikiversity.

We can definitely play with the "trusted edits" set. @Lsanabria, are there any user-rights on Spanish Wikiversity that you think might indicated a "trusted" status? Also, do you think if we labeled edits by anyone with over a couple hundred edits as "trusted", would that mostly work out OK? Note that even "trusted" edits get loaded into Wiki Labels for review if they are reverted.

Apr 29 2019, 3:03 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), Spanish-Sites, editquality-modeling, Wikilabels, artificial-intelligence

Halfak added a comment to T219498: Wikimedia Hackathon 2019 Mentoring Program .

I think mentor matching needs at least half an hour. I'd like to have a bit of buffer time, so an hour would be better if we can swing it.

Apr 29 2019, 2:12 PM · International-Developer-Events, Developer-Advocacy (Apr-Jun 2019), Wikimedia-Hackathon-2019, Wikimedia-Hackathon-2019-Organization

Halfak added a comment to T199355: Investigate srwiki goodfaith model, why is it so bad?.

It looks like a lot of the edits that were labeled "badfaith" but that we have no re-labeled "goodfaith" were saved by @Zoranzoki21. That might be simply because Zoranzoki21 did a lot of labeling work. Would you take a look at them to see if you agree with our re-assessment? Maybe there is some confusion as to the meaning of "goodfaith".

Apr 29 2019, 2:08 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

Halfak added a comment to T199355: Investigate srwiki goodfaith model, why is it so bad?.

So, as it stands, more than half of the items labeled badfaith are actually goodfaith upon review. I'll look into these labels to see if I can see some sort of consistency with them.

Apr 29 2019, 1:37 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

Halfak added a comment to T199355: Investigate srwiki goodfaith model, why is it so bad?.

An etherpad is directly editable. You should be able to just type into it.

Apr 29 2019, 1:36 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

Apr 26 2019

Halfak added a comment to T199355: Investigate srwiki goodfaith model, why is it so bad?.

I just labeled a few. I'm seeing some edits that look like they are goodfaith in this set. I wonder if I am missing something.

Apr 26 2019, 9:32 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

Halfak added a comment to T199355: Investigate srwiki goodfaith model, why is it so bad?.

I've dumped all of the edits labeled "badfaith" into this etherpad: https://etherpad.wikimedia.org/p/srwiki_badfaith_edits

Apr 26 2019, 9:25 PM · artificial-intelligence, editquality-modeling, Wikilabels, Machine-Learning-Team (Active Tasks), Serbian-Sites

Halfak added a comment to T209670: Create editquality campaign for Spanish Wikiversity.

@Ladsgroup i just pinged you in the task because it looks like the data is a little weird and I had some questions about it a few weeks ago that look like they are still unanswered.

Apr 26 2019, 8:10 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), Spanish-Sites, editquality-modeling, Wikilabels, artificial-intelligence

Halfak closed T201047: Use Joblib for ORES model serialization as Declined.

Apr 26 2019, 8:05 PM · Machine-Learning-Team (Active Tasks), ORES

Halfak moved T200365: Explore alternative names for Jade data from Review to Completed on the Machine-Learning-Team (Active Tasks) board.

Apr 26 2019, 8:04 PM · Machine-Learning-Team (Active Tasks), Jade

Halfak moved T221618: ores icinga check for grafana alert returns 404 from Review to Completed on the Machine-Learning-Team (Active Tasks) board.

Apr 26 2019, 8:04 PM · User-Ladsgroup, Machine-Learning-Team (Active Tasks), ORES

Halfak moved T221780: Generate datasets for sociative (vital 10k and some wikiprojects) from Review to Completed on the Machine-Learning-Team (Active Tasks) board.

Apr 26 2019, 8:04 PM · Machine-Learning-Team (Active Tasks)

Apr 25 2019

Halfak added a comment to T215354: Enable ORES RCFilters for German Wikipedia (dewiki).

@JTannerWMF, per T164331: Define a process for adding ORES filters to new wikis when ORES is enabled on those wikis, I believe we'd decided that it is the #Growth team's responsibility to get buy-in for enabling these features since you're maintaining the UI. I'd suggest reaching out to the Wikipedians who did the most labeling work.

Apr 25 2019, 7:48 PM · Growth-Team-Filtering, Growth-Team, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters

Halfak created T221886: Add links to wiki documentation on PAWS index page.

Apr 25 2019, 7:34 PM · PAWS

Halfak updated subscribers of T221870: Why are there three Q-marks (???) in threshholds in Special:ORESModels?.

In conversation with @Ladsgroup, we couldn't figure out whether unused thresholds should be invisible (e.g. https://he.wikipedia.org/wiki/%D7%9E%D7%99%D7%95%D7%97%D7%93:ORESModels )

Apr 25 2019, 5:06 PM · Growth-Team-Filtering, Growth-Team, Machine-Learning-Team, ORES, MediaWiki-extensions-ORES

Halfak added a project to T221870: Why are there three Q-marks (???) in threshholds in Special:ORESModels?: Growth-Team.

Apr 25 2019, 4:22 PM · Growth-Team-Filtering, Growth-Team, Machine-Learning-Team, ORES, MediaWiki-extensions-ORES

Halfak added a project to T221871: Non-overlapping threshholds in ORESModels on lvwiki: Growth-Team.

Apr 25 2019, 4:04 PM · Growth-Team (Sprint 0 (Growth Team)), ORES, Machine-Learning-Team, MediaWiki-extensions-ORES

Apr 24 2019

Halfak added a comment to T219547: Wikimedia Hackathon 2019: Design a flowchart .

I'd prefer PNG over JPG if SVG or another vector format is not an option. No one wants those JPG artifacts messing with your nice, crisp design. :D

Apr 24 2019, 3:28 PM · International-Developer-Events, MoveComms-Support (Apr-Jun-2019), Wikimedia-Hackathon-2019-Organization, CommRel-Design

Halfak added a comment to T221780: Generate datasets for sociative (vital 10k and some wikiprojects).

Here are titles for African Diaspora (Hard mode): https://quarry.wmflabs.org/run/366495/output/0/json
Here are titles for a Women Scientists (Probably less hard): https://quarry.wmflabs.org/run/366489/output/0/json

Apr 24 2019, 3:07 PM · Machine-Learning-Team (Active Tasks)

Halfak added a comment to T221780: Generate datasets for sociative (vital 10k and some wikiprojects).

I just updated https://github.com/halfak/taxonomy_examples with a new dataset called "vital_10k_taxonomy.json" I'll be working on getting another dataset with pages that fall into a specific topic cross-section next.

Apr 24 2019, 3:06 PM · Machine-Learning-Team (Active Tasks)

Halfak created T221780: Generate datasets for sociative (vital 10k and some wikiprojects).

Apr 24 2019, 3:06 PM · Machine-Learning-Team (Active Tasks)

Halfak added a comment to T120170: [Epic] Paid editing (COI) detection model.

(EC) @JEumerus/@Thryduulf, false positives/negatives don't come into play until there is a model. We'll certainly be looking at fitness statistics and manually reviewing false positives once that model is first built. This has been the pattern for vandalism fighting ORES models and I think it is wholly appropriate for this modeling work as well. Once we know what the model is able to detect and what it can be useful for, then we can discuss "tools" and usecases.

Apr 24 2019, 2:16 PM · Machine-Learning-Team, research-ideas, artificial-intelligence

Halfak added a comment to T167608: Add caused_by_user_text to mediawiki_page_history.

Hey folks. I've been following this task, but I might not have the full context, so take what I say with a grain of salt that is appropriately sized.

Apr 24 2019, 2:01 PM · Analytics-Kanban, Analytics

Apr 23 2019

Halfak added a comment to T189569: Avoid contradicting messages between Getting Started and Visual editor for new users.

Wow! There shouldn't be. That was so long ago. Like, 5 years! I'm not sure how I would check. If there was one running, it would have split newcomer groups by their user_id. E.g. Odd user_ids would have been bucketed in experimental and even user_ids would have been bucketed in control (or vice versa)

Apr 23 2019, 9:13 PM · Growth-Team-Filtering, Growth-Team, Growth Design, New-Editor-Experiences, MediaWiki-extensions-GettingStarted

Halfak updated subscribers of T221696: ORESModels has a fatal error.

@Catrope, I figured you might know what's up.

Apr 23 2019, 8:38 PM · MW-1.34-notes (1.34.0-wmf.1; 2019-04-16), Wikimedia-production-error, MediaWiki-extensions-ORES, Machine-Learning-Team

Halfak moved T221696: ORESModels has a fatal error from Unsorted to Maintenance/cleanup on the Machine-Learning-Team board.

Apr 23 2019, 8:19 PM · MW-1.34-notes (1.34.0-wmf.1; 2019-04-16), Wikimedia-production-error, MediaWiki-extensions-ORES, Machine-Learning-Team

Halfak triaged T221696: ORESModels has a fatal error as High priority.

Apr 23 2019, 8:19 PM · MW-1.34-notes (1.34.0-wmf.1; 2019-04-16), Wikimedia-production-error, MediaWiki-extensions-ORES, Machine-Learning-Team

Halfak created T221696: ORESModels has a fatal error.

Apr 23 2019, 8:19 PM · MW-1.34-notes (1.34.0-wmf.1; 2019-04-16), Wikimedia-production-error, MediaWiki-extensions-ORES, Machine-Learning-Team

GitHub <noreply@github.com> committed rOEQ73bab7bdd0ef: Merge pull request #193 from wikimedia/fiwiki_v2 (authored by Halfak).

Merge pull request #193 from wikimedia/fiwiki_v2