Page MenuHomePhabricator

Halfak (Aaron Halfaker, EpochFail, halfak)
Principal Research Scientist

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Oct 21 2014, 6:05 PM (361 w, 2 d)
Availability
Available
IRC Nick
halfak
LDAP User
Halfak
MediaWiki User
EpochFail [ Global Accounts ]

Hi! I'm a socio-technologist. I do science so that I can build new technologies for social systems.

You can find me as:

Recent Activity

Today

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

^ New version of the model using updated features and manually extracted labels.

Thu, Sep 23, 4:42 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

https://github.com/wikimedia/articlequality/pull/168

Thu, Sep 23, 4:34 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak committed rOWCed260ae53796: nlwiki features for infobox and list_items (authored by Halfak).
nlwiki features for infobox and list_items
Thu, Sep 23, 4:01 PM
Halfak committed rOWC355af1a42c1a: Adds manually extracted nlwiki labels. (authored by Halfak).
Adds manually extracted nlwiki labels.
Thu, Sep 23, 4:01 PM

Mon, Sep 20

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

I was able to gather 64 new labels from the wiki. Most of them were E class, but we did get some B, C and D -- which are hard to differentiate.

Mon, Sep 20, 5:52 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

Thu, Sep 9

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

Still waiting on a review/merge. In the meantime, @Psingh07 is working on gathering new labeled data from the reviewing work folks did on the wiki pages.

Thu, Sep 9, 4:29 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

Thu, Aug 26

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

Once this is merged, I'll use this and other improvements to re-generate the models. Then we can use those models to consider a new labeling campaign based on the new quality criteria.

Thu, Aug 26, 5:39 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

I added a wikitext.revision.list_items feature to revscoring for tracking articles that are in outline form (as opposed to prose). See https://github.com/wikimedia/revscoring/pull/506

Thu, Aug 26, 5:38 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

Jul 30 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

Sorry. One final thought. We could make the quality classes non-ordinal. E.g. call the lowest class Beginnetje and the highest class Etalage, and develop common sense names for the classes in between. That way, order may be plainly apparent and in between classes would require a common sense name as well--rather than something like "B-" or "C+".

Jul 30 2021, 5:15 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

I should say, this pattern of retraining also works for between classes too.

Jul 30 2021, 5:11 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

It should be OK to change the meaning of the current classes over time too. One nice thing about using an ML model to supplement quality assessment is that it is easy to propagate changes like that. E.g. if we adjust the definition of a quality classes, we just need to review our training data (50-75 articles per quality class) to fix the labels and retrain.

Jul 30 2021, 5:10 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

Jul 29 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

We're unblocked with new work. We have new code ready for modeling/testing that improved unsourced content detection.

Jul 29 2021, 6:04 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

Jul 20 2021

Halfak added a comment to T287021: Move CJK segmentation features to a branch and revert revscoring.

https://github.com/wikimedia/revscoring/pull/505

Jul 20 2021, 3:46 PM · Patch-For-Review, artificial-intelligence, revscoring, Machine-Learning-Team (Active Tasks)
Halfak created T287021: Move CJK segmentation features to a branch and revert revscoring.
Jul 20 2021, 3:46 PM · Patch-For-Review, artificial-intelligence, revscoring, Machine-Learning-Team (Active Tasks)

Jun 23 2021

Halfak added a comment to T284687: Resource allocation request for the wikicommunityhealth project.

I suggest referencing https://pythonhosted.org/mwxml/map.html#mwxml.map

Jun 23 2021, 6:15 PM · Cloud-VPS (Quota-requests)

Jun 8 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

I see. You're asking to include the "weighted sum" measure in the JSON output?

Jun 8 2021, 3:17 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

The output of https://ores.wikimedia.org/v3/scores/nlwiki/123125/articlequality is pure JSON and links are not possible in this data format.

Jun 8 2021, 3:00 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

Jun 3 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

Here's the importance table. The higher the importance score, the more important the value is to the prediction. It turns out that the count of category links is the least important feature of the set. Overall length of the article, the amount of content with references, and the proportion of content that is referenced are the dominant features.

Jun 3 2021, 2:45 AM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

Jun 2 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

Sorry for the late response. The holiday weekend in the US (memorial day) had me out of my usual flow.

Jun 2 2021, 3:59 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

May 24 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

Yeah! We can exclude the Main page. I wonder if there is a good way to identify if we're loading the script on the main page in a wiki/language independent way. In the meantime, I'll look into making a special case for nlwiki.

May 24 2021, 12:06 AM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

May 21 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

I've added the class template to the user script code. It looks like the output of the script (on the top of the page) looks right now.

May 21 2021, 8:33 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

May 19 2021

Halfak added a comment to T251571: Build article quality model for Ukrainian Wikipedia.

Confirmed. https://ores.wikimedia.org/v3/scores/ukwiki shows articlequality version 0.8.0.

May 19 2021, 5:51 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, articlequality-modeling, Wikilabels

May 17 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

I updated the language for the tool so that it should show the dutch version now.

May 17 2021, 10:33 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak added a comment to T130273: Complete viwiki edit quality campaign.

Nice work! There should not have been bot edits in the dataset. But this dataset is very very old so I wonder if that is why some snuck in. I wonder if maybe some of those bots aren't using the bot flag? We usually don't filter by username when generating the dataset, but that's an option.

May 17 2021, 7:21 PM · artificial-intelligence, Machine-Learning-Team, editquality-modeling, Wikilabels
Halfak added a comment to T257359: Update Turkish Wikipedia's labeling campaign for 2020.

It does!

May 17 2021, 7:01 PM · Patch-For-Review, Turkish-Sites, artificial-intelligence, editquality-modeling, Machine-Learning-Team
Halfak added a comment to T257359: Update Turkish Wikipedia's labeling campaign for 2020.

I just had a chance to take a look at this. Sorry I was AFK this weekend. But all looks good to me.

May 17 2021, 4:07 PM · Patch-For-Review, Turkish-Sites, artificial-intelligence, editquality-modeling, Machine-Learning-Team

May 13 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

Nice work! Thanks folks.

May 13 2021, 5:10 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

May 11 2021

Halfak added a comment to T257359: Update Turkish Wikipedia's labeling campaign for 2020.

Looks good. I made a parallel PR to fix a weird travis issue that was preventing your tests from passing. If you merge https://github.com/wikimedia/editquality/pull/234 and rebase, I think the tests will pass then.

May 11 2021, 7:52 PM · Patch-For-Review, Turkish-Sites, artificial-intelligence, editquality-modeling, Machine-Learning-Team
Halfak added a comment to T278723: ORES deployment - Spring 2021.

The fix is merged. Because of T212818, you'll need to manually propagate the changes to gerrit for the drafttopic repo before updating the deploy repo and re-deploying to Beta.

May 11 2021, 3:45 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

May 10 2021

Halfak added a comment to T223782: Build article quality model for Dutch Wikipedia.

Sure! We can even use local templates. Would you be interested in creating templates with badges/colors you like for the prediction that appears on the top of the page?

May 10 2021, 5:14 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team

May 6 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

@kevinbazira, it would be great if you could review for me.

May 6 2021, 3:12 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

May 5 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

Found a few minutes. Rebuild in progress.

May 5 2021, 5:19 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Sorry I missed one of your other questions.

May 5 2021, 4:32 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

The pipelines are documented/automated in the relevant Makefiles. E.g. if you install the dependencies for https://github.com/wikimedia/drafttopic, delete the old viwiki models and run make models it should rebuild the relevant models.

May 5 2021, 4:05 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

I'll try to find some time this evening to rebuild the viwiki model with the right version of sklearn.

May 5 2021, 3:57 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Aha! Looks like https://ores-beta.wmflabs.org/v3/scores/viwiki/123125/articletopic raises the error.

May 5 2021, 3:55 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Can we figure out what request caused this error?

May 5 2021, 3:53 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

May 3 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

Here's the initial draft of that work: https://meta.wikimedia.org/wiki/Research:On_the_collaboration_with_Wikimedia_Communities_in_the_context_of_building_Machine_Learning_Systems

May 3 2021, 4:05 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak updated subscribers of T278723: ORES deployment - Spring 2021.

I totally agree about big deployments! It's been too long.

May 3 2021, 4:04 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 30 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

Looks like I was mistaken while I was reading the graphs. The change is from 27% to 22% of available memory. So no substantial change which is a minor surprise but not that big of a surprise given that we are adding 7 new models while reducing memory consumption from word vector embeddings.

Apr 30 2021, 12:04 AM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 29 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

I added some details about the nature of the deployment to the task description. Main concern is memory usage changes. We saw a drop in memory usage on Beta as expected. It appears that we see a drop in memory usage from 40% to 22% on web nodes.

Apr 29 2021, 11:34 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak updated the task description for T278723: ORES deployment - Spring 2021.
Apr 29 2021, 11:02 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

@kevinbazira, it looks like we're ready to proceed with a production deployment. I'll drop you an email about scheduling.

Apr 29 2021, 6:09 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

I filed T281495: Restructure ORES labs redis puppet role because that seems like it is solvable and might be part of the ORES test environment migration to the MWCS project.

Apr 29 2021, 3:40 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak created T281495: Restructure ORES labs redis puppet role .
Apr 29 2021, 3:38 PM · Infrastructure-Foundations, Puppet, Machine-Learning-Team, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Confirmed! https://ores-beta.wmflabs.org/v3/scores/nlwiki/12345678/articlequality works.

Apr 29 2021, 3:36 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 28 2021

Halfak added a comment to T257359: Update Turkish Wikipedia's labeling campaign for 2020.

I left some comments on the PR. There's just some practical data-work to do. I hope my instructions are clear enough. There's also a weird travis failure that I hope to look into. It will require running python 3.5 locally. I believe python 3.5 is the default python3 on ores-misc-01 so that should make things easy.

Apr 28 2021, 11:22 PM · Patch-For-Review, Turkish-Sites, artificial-intelligence, editquality-modeling, Machine-Learning-Team
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Aha! I misunderstood. I'm seeing the auth error re-appear https://ores-beta.wmflabs.org/v3/scores/enwiki/

Apr 28 2021, 5:14 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Note that we have redis servers on port 6380 and 6379 so the password will need to be set for both.

Apr 28 2021, 4:40 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Thanks for your work @elukey!

Apr 28 2021, 4:40 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 27 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

For reference, this is where ORES references Envoy (localhost:6500).

Apr 27 2021, 4:38 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

@Majavah after some thought, I think it would be great if you would look into Envoy for Beta. Honestly, I am asking this because it makes my work easier. But I also figure that anything else that is using Envoy in production would probably want to also use Envoy in Beta.

Apr 27 2021, 4:01 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 26 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

https://wikitech.wikimedia.org/wiki/Ores/Deployment is the key reference

Apr 26 2021, 6:00 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

We do our production test deployments in deployment-prep (ores-beta.wmflabs.org). ores.wmflabs.org is for experimental model deployments. E.g. scap is set up to work in beta. For ores.wmflabs.org we use fabric to deploy because scap is unavailable.

Apr 26 2021, 3:40 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 24 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

Thanks for the consideration Majavah. Right now, I don't see a good option.

Apr 24 2021, 7:23 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Here's the most relevant patchset. https://gerrit.wikimedia.org/r/c/mediawiki/services/ores/deploy/+/621522

Apr 24 2021, 6:40 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak updated subscribers of T278723: ORES deployment - Spring 2021.

I found https://wikitech.wikimedia.org/wiki/Envoy.

Apr 24 2021, 6:09 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Looks like the redis-cli ignores passwords if they aren't required. I tried setting the password requirement by running CONFIG SET requirepass "<thepassword>" in the terminal and that seemed to work.

Apr 24 2021, 6:07 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

It looks like we do use a password to connect and when I use that password on deployment-ores01, it connects to redis just fine.

Apr 24 2021, 5:29 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

We got a connection error when trying to talk to redis.

Apr 24 2021, 5:24 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T278723: ORES deployment - Spring 2021.

OK I've made the updates and we're ready for a new deployment to Beta, but I'm still blocked on being able to run scap myself.

Apr 24 2021, 4:36 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 23 2021

Halfak created T280998: Scap deploy for ORES reports success even when uwsgi fails to start up.
Apr 23 2021, 6:47 PM · Scap, ORES, Machine-Learning-Team

Apr 22 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

I have now. I think the non-updating submodules was a red herring. I see now that the code and assets filenames were not aligned. I've got a change in progress that should resolve T280420.

Apr 22 2021, 9:30 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a comment to T280420: ores-beta.wmflabs.org is unreachable.

I confirmed that some code was not updated for these models and that is causing the issue. I have a change in progress that should resolve the issue. I'd like to keep this task open until we can get ores-beta back online.

Apr 22 2021, 9:29 PM · Beta-Cluster-Infrastructure, Machine-Learning-Team
Halfak added a comment to T280420: ores-beta.wmflabs.org is unreachable.

Aha! It does seem like there is a mismatch here. I'm not sure why it appears that the submodules are not being updated. That might be a red herring. This code and these assets should be in alignment and they are not. I'll go digging. Thanks @elukey

Apr 22 2021, 9:17 PM · Beta-Cluster-Infrastructure, Machine-Learning-Team
Halfak added a comment to T278723: ORES deployment - Spring 2021.

It looks like @elukey's deployment run failed to update the submodules on the deployment host (deployment-ores01). Here's what I see on the deployment host:

Apr 22 2021, 4:43 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 16 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

Looks like we can't reach ores-beta.wmflabs.org. I created a task for exploring it T280420: ores-beta.wmflabs.org is unreachable

Apr 16 2021, 11:12 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak created T280420: ores-beta.wmflabs.org is unreachable.
Apr 16 2021, 11:10 PM · Beta-Cluster-Infrastructure, Machine-Learning-Team

Apr 13 2021

Halfak added a comment to T257359: Update Turkish Wikipedia's labeling campaign for 2020.

This change will look a lot like this work for ptwiki: https://github.com/wikimedia/editquality/pull/225/files

Apr 13 2021, 5:57 PM · Patch-For-Review, Turkish-Sites, artificial-intelligence, editquality-modeling, Machine-Learning-Team
Halfak added a comment to T278723: ORES deployment - Spring 2021.

Deploy failed with the following error:

Apr 13 2021, 4:59 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 8 2021

Halfak added a comment to T278723: ORES deployment - Spring 2021.

Thank you! Will run a test on beta when I get a chance and report back here.

Apr 8 2021, 6:47 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Apr 1 2021

Halfak added a comment to T277609: Generate dump of scored-revisions from 2018-2020 for English Wikipedia.

I wonder if this is related to: T104004: Can't download large datasets from datasets.wikimedia.org

Apr 1 2021, 5:14 PM · Analytics-Kanban, Data-Services, artificial-intelligence, editquality-modeling, ORES, Analytics, Machine-Learning-Team

Mar 30 2021

Halfak added a comment to T257359: Update Turkish Wikipedia's labeling campaign for 2020.

I've finally got the deployment of ORES unblocked. That was a surprising large amount of work to get things cleaned up. We're now blocked on getting this to production before we can get retrained Turkish models out. See T278723: ORES deployment - Spring 2021.

Mar 30 2021, 4:49 PM · Patch-For-Review, Turkish-Sites, artificial-intelligence, editquality-modeling, Machine-Learning-Team
Halfak moved T278723: ORES deployment - Spring 2021 from Non-Project Work to Review on the Machine-Learning-Team (Active Tasks) board.
Mar 30 2021, 4:36 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak claimed T278723: ORES deployment - Spring 2021.
Mar 30 2021, 4:01 PM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak removed a parent task for T246909: Follow-up cleanup to topic models: T278723: ORES deployment - Spring 2021.
Mar 30 2021, 12:58 AM · drafttopic-modeling, Machine-Learning-Team
Halfak removed a subtask for T278723: ORES deployment - Spring 2021: T246909: Follow-up cleanup to topic models.
Mar 30 2021, 12:58 AM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a parent task for T249382: Scale: ORES topic models for uk, hu, hy, eu, sr (needed as soon as available): T278723: ORES deployment - Spring 2021.
Mar 30 2021, 12:57 AM · Machine-Learning-Team (Active Tasks), Serbian-Sites, Growth-Scaling, Growth-Team
Halfak added a subtask for T278723: ORES deployment - Spring 2021: T249382: Scale: ORES topic models for uk, hu, hy, eu, sr (needed as soon as available).
Mar 30 2021, 12:57 AM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak added a parent task for T223782: Build article quality model for Dutch Wikipedia: T278723: ORES deployment - Spring 2021.
Mar 30 2021, 12:56 AM · artificial-intelligence, articlequality-modeling, Wikilabels, Machine-Learning-Team
Halfak added a parent task for T246909: Follow-up cleanup to topic models: T278723: ORES deployment - Spring 2021.
Mar 30 2021, 12:56 AM · drafttopic-modeling, Machine-Learning-Team
Halfak added a parent task for T249520: Fit more topic models into ORES: T278723: ORES deployment - Spring 2021.
Mar 30 2021, 12:56 AM · drafttopic-modeling, Machine-Learning-Team
Halfak added subtasks for T278723: ORES deployment - Spring 2021: T223782: Build article quality model for Dutch Wikipedia, T249520: Fit more topic models into ORES, T246909: Follow-up cleanup to topic models.
Mar 30 2021, 12:56 AM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES
Halfak created T278723: ORES deployment - Spring 2021.
Mar 30 2021, 12:53 AM · Patch-For-Review, Machine-Learning-Team, artificial-intelligence, drafttopic-modeling, articlequality-modeling, ORES

Mar 29 2021

Halfak committed rOWCd3d777592e5c: Handles flake8 issue with nlwiki features. (authored by Halfak).
Handles flake8 issue with nlwiki features.
Mar 29 2021, 12:45 AM
Halfak committed rOWCf8efe0f07755: Update nlwiki.py (authored by Psingh07).
Update nlwiki.py
Mar 29 2021, 12:45 AM
Halfak committed rOWC9ab054bb5311: nlwiki update (authored by Cdrpar07 <shgcdr07@ores-misc-01.ores-staging.eqiad1.wikimedia.cloud>).
nlwiki update
Mar 29 2021, 12:45 AM
Halfak committed rOWC1b55cc1e5210: Fixes template names for dutch citation needed (authored by Aaron Halfaker <ahalfaker@wikimedia.org>).
Fixes template names for dutch citation needed
Mar 29 2021, 12:45 AM
Halfak committed rOWC0d73caf59808: Adds nlwiki model with basic features. (authored by Aaron Halfaker <ahalfaker@wikimedia.org>).
Adds nlwiki model with basic features.
Mar 29 2021, 12:45 AM

Mar 26 2021

Halfak committed R2300:08b9cebc5e01: Adds vectors for eu, hy, hu, sr, uk, and wikidatawiki (authored by Halfak).
Adds vectors for eu, hy, hu, sr, uk, and wikidatawiki
Mar 26 2021, 9:41 AM

Mar 25 2021

Halfak added a hashtag to Machine-Learning-Team: #scoring-platform-team.
Mar 25 2021, 5:15 PM
Halfak added a hashtag to Machine-Learning-Team: #scoring_platform_team.
Mar 25 2021, 5:14 PM

Mar 16 2021

Halfak updated the task description for T277609: Generate dump of scored-revisions from 2018-2020 for English Wikipedia.
Mar 16 2021, 10:13 PM · Analytics-Kanban, Data-Services, artificial-intelligence, editquality-modeling, ORES, Analytics, Machine-Learning-Team
Halfak updated the task description for T277609: Generate dump of scored-revisions from 2018-2020 for English Wikipedia.
Mar 16 2021, 10:13 PM · Analytics-Kanban, Data-Services, artificial-intelligence, editquality-modeling, ORES, Analytics, Machine-Learning-Team
Halfak created T277609: Generate dump of scored-revisions from 2018-2020 for English Wikipedia.
Mar 16 2021, 10:13 PM · Analytics-Kanban, Data-Services, artificial-intelligence, editquality-modeling, ORES, Analytics, Machine-Learning-Team

Mar 5 2021

Halfak added a comment to T276598: Create Draft Model Deployment Guidelines .

Currently there are no set of policies in place that candidate models (internally and externally) must meet in order to be deployed. This is highly problematic.

Mar 5 2021, 11:25 PM · ML-Governance, ORES, Lift-Wing, artificial-intelligence, Machine-Learning-Team (Active Tasks)

Feb 12 2021

Halfak added a comment to T135908: Add a possibility to delete a draft.

FWIW, I think there's a big difference between "delete" and "archive". Delete breaks links and hides past activity. Archive gets stuff I don't want to see out of the way. I think "archive" is the right metaphor here. I would hate it if someone could no longer download the results of a query because some user decided to delete it.

Feb 12 2021, 5:37 PM · Quarry

Feb 10 2021

Halfak changed the status of T117802: WikiData model: Unsupported operand type(s) for /: 'NoneType' and 'float' from Declined to Resolved.

Looks like the issue was actually resolved. I don't see this error in production anymore.

Feb 10 2021, 5:01 PM · wb_vandalism

Feb 2 2021

Halfak added a comment to T257359: Update Turkish Wikipedia's labeling campaign for 2020.

Fantastic! I can work with the data you have provided to update the model. I'll try to get that work in soon but as I'm just a volunteer with a new baby, I can't give you any guarantees on when I'll be able to get to it. But a week or two seems likely at this point.

Feb 2 2021, 10:37 PM · Patch-For-Review, Turkish-Sites, artificial-intelligence, editquality-modeling, Machine-Learning-Team
Fae awarded T214201: Implement NSFW image classifier using Open NSFW a Dislike token.
Feb 2 2021, 5:26 PM · Structured-Data-Backlog, artificial-intelligence