Halfak (Aaron Halfaker, EpochFail, halfak)
Principal Research Scientist

Projects (17)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Oct 21 2014, 6:05 PM (204 w, 2 d)
Availability
Available
IRC Nick
halfak
LDAP User
Halfak
MediaWiki User
EpochFail [ Global Accounts ]

Hi! I'm a socio-technologist. I do science so that I can build new technologies for social systems.

You can find me as:

Recent Activity

Wed, Sep 19

Krinkle awarded T201691: Implement JS ORES client in mw-ORES extension a Orange Medal token.
Wed, Sep 19, 6:21 AM · Patch-For-Review, User-Ladsgroup, Scoring-platform-team (Current), MediaWiki-extensions-ORES, Gadgets

Fri, Sep 14

Halfak updated the task description for T204250: Document JADE schema proposals and justifications.
Fri, Sep 14, 3:17 PM · Documentation, Scoring-platform-team (Current)
Halfak added a comment to T204250: Document JADE schema proposals and justifications.

I'm not interested in re-visiting history. I think it's valuable to include all proposals in one space. There are some proposals that there might be consensus against, but I don't think that has been filed. E.g. it seems to me that "single judgment" is still an option. It doesn't serve all of our use-cases but it is *far simpler* and thus an interesting proposal from a product perspective.

Fri, Sep 14, 3:17 PM · Documentation, Scoring-platform-team (Current)
Halfak renamed T204250: Document JADE schema proposals and justifications from Document the history of JADE schema proposals and justifications to Document JADE schema proposals and justifications.
Fri, Sep 14, 2:52 PM · Documentation, Scoring-platform-team (Current)
Halfak awarded T198707: Hashtag search tools is down a Like token.
Fri, Sep 14, 2:04 PM · Hashtags

Thu, Sep 13

Halfak updated the task description for T204250: Document JADE schema proposals and justifications.
Thu, Sep 13, 8:52 PM · Documentation, Scoring-platform-team (Current)
Halfak updated the task description for T204250: Document JADE schema proposals and justifications.
Thu, Sep 13, 8:52 PM · Documentation, Scoring-platform-team (Current)
Halfak claimed T204250: Document JADE schema proposals and justifications.
Thu, Sep 13, 8:50 PM · Documentation, Scoring-platform-team (Current)
Halfak created T204250: Document JADE schema proposals and justifications.
Thu, Sep 13, 3:12 PM · Documentation, Scoring-platform-team (Current)

Wed, Sep 12

Halfak added a comment to T202596: Write our anticipated "phase two" schemas and submit for review.

The main thing driving us to that conclusion was that a "notes" field should be shared between damaging and goodfaith judgments, which is satisfied by this MCR design.

Wed, Sep 12, 8:40 PM · Patch-For-Review, DBA, Scoring-platform-team (Current), User-Joe, Operations, JADE
Halfak added a comment to T201146: Build article quality model for Galician Wikipedia.

How would you say "Assess article quality" in Galician? I'll use that to name the "labeling campaign" in Wiki labels.

Wed, Sep 12, 8:34 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Scoring-platform-team
Halfak edited projects for T201657: Make ORES dependency solving upstreamable, added: Scoring-platform-team; removed Scoring-platform-team (Current).
Wed, Sep 12, 4:50 PM · Scoring-platform-team, ORES, Upstream
Halfak added a comment to T201887: JADE literature review.

@awight said he had a bunch more stuff to paste in.

Wed, Sep 12, 4:49 PM · Scoring-platform-team (Current), JADE
Halfak created T204134: Workshop proposal for CSCW (JADE, ORES, etc.).
Wed, Sep 12, 4:49 PM · Documentation, Scoring-platform-team (Current)
Halfak added a comment to T201825: Test poolcounter support for ores in beta cluster.

Seems like this happened. Is it done?

Wed, Sep 12, 4:48 PM · User-Ladsgroup, Scoring-platform-team (Current), ORES
Halfak moved T203468: Update monthly article quality datasets from Current to Research & analysis on the Scoring-platform-team board.
Wed, Sep 12, 4:42 PM · Scoring-platform-team, artificial-intelligence, articlequality-modeling
Halfak edited projects for T203468: Update monthly article quality datasets, added: Scoring-platform-team (Current); removed Scoring-platform-team.
Wed, Sep 12, 4:40 PM · Scoring-platform-team, artificial-intelligence, articlequality-modeling
Halfak claimed T203468: Update monthly article quality datasets.
Wed, Sep 12, 4:39 PM · Scoring-platform-team, artificial-intelligence, articlequality-modeling
Halfak added a comment to T201146: Build article quality model for Galician Wikipedia.

I can get the GA and FA articles for model-training with this query: https://quarry.wmflabs.org/query/29680

Wed, Sep 12, 3:53 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Scoring-platform-team
Halfak updated the task description for T201142: Add language support for galician.
Wed, Sep 12, 3:07 PM · artificial-intelligence, Bad-Words-Detection-System, revscoring, Scoring-platform-team
Halfak added a comment to T202596: Write our anticipated "phase two" schemas and submit for review.

For clarity, here's a rough version of the endorsements proposal that I'd originally put together about a year ago(from schema etherpad and jade repo ):

Wed, Sep 12, 2:34 PM · Patch-For-Review, DBA, Scoring-platform-team (Current), User-Joe, Operations, JADE
Halfak added a comment to T202596: Write our anticipated "phase two" schemas and submit for review.

With the wikitext slot, we won't know which note relates to which judgement. This is like having one big "notes" field for every entity.

Wed, Sep 12, 2:09 PM · Patch-For-Review, DBA, Scoring-platform-team (Current), User-Joe, Operations, JADE

Tue, Sep 11

Halfak added a comment to T201146: Build article quality model for Galician Wikipedia.

I think the best place to do that would be to simply describe a labeling scale in a similar way to what you see at https://en.wikipedia.org/wiki/Wikipedia:Content_assessment. @Theklan, do you have such a documentation page describing the scale on euwiki?

Tue, Sep 11, 7:27 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Scoring-platform-team
Halfak added a comment to T164994: Enable wbgetsuggestions API to get recommended properties even if they have existed in an item.

@hoo, any updates? Seems like this task has been stagnant for a few weeks.

Tue, Sep 11, 6:37 PM · MediaWiki-extensions-PropertySuggester, Patch-For-Review, MediaWiki-API, Wikidata
Halfak updated subscribers of T183200: Blog about JADE.

I resolved everything except something that needs review from @DannyH. I think that's our final blocker.

Tue, Sep 11, 3:51 PM · Community-Tech, Scoring-platform-team (Current), WMF-Communications, JADE
Halfak moved T183200: Blog about JADE from Active to Review on the Scoring-platform-team (Current) board.
Tue, Sep 11, 3:51 PM · Community-Tech, Scoring-platform-team (Current), WMF-Communications, JADE

Mon, Sep 10

Halfak added a comment to T152012: Silence or address E_WOULDBLOCK warning.

I don't see how pool counter would help us sequence our tasks in celery. What's your proposal?

Mon, Sep 10, 2:03 PM · ORES, Scoring-platform-team

Fri, Sep 7

Halfak awarded Blog Post: Additional details on OurMine a Like token.
Fri, Sep 7, 6:51 PM · Security-Team
Halfak added a comment to Blog Post: Additional details on OurMine.

Was 2FA for admins part of this response?

Fri, Sep 7, 6:51 PM · Security-Team
Halfak awarded Blog Post: Details of dictionary attack from May 2018 a Like token.
Fri, Sep 7, 6:49 PM · Security-Team

Thu, Sep 6

Halfak committed rOEQ17ac8bd6fc2f: Handle deleted users in self-revert check. (authored by Halfak).
Handle deleted users in self-revert check.
Thu, Sep 6, 1:24 PM
Halfak committed rOEQd048d9c07dda: Adds Makefile rules for simplewiki reverted model. (authored by Halfak).
Adds Makefile rules for simplewiki reverted model.
Thu, Sep 6, 1:24 PM
Aaron Halfaker <ahalfaker@wikimedia.org> committed rOEQ799adeb7d6b6: Adds human_labeled and merged samples to huwiki.yaml (authored by Halfak).
Adds human_labeled and merged samples to huwiki.yaml
Thu, Sep 6, 1:24 PM
Halfak committed rOEQ6b9207784fab: Makes Makefile template more modular. (authored by Halfak).
Makes Makefile template more modular.
Thu, Sep 6, 1:24 PM
Halfak committed rOWC56f6157fd97b: More flexible score extractor weights. (authored by Halfak).
More flexible score extractor weights.
Thu, Sep 6, 12:50 PM

Wed, Sep 5

Halfak closed T203610: Fix diffing issue (mistaken added text) as Resolved.
Wed, Sep 5, 9:48 PM · Scoring-platform-team (Current)
Halfak added a comment to T203610: Fix diffing issue (mistaken added text).

Got it. I released deltas 0.4.7.

Wed, Sep 5, 9:48 PM · Scoring-platform-team (Current)
Halfak claimed T203610: Fix diffing issue (mistaken added text).
Wed, Sep 5, 9:38 PM · Scoring-platform-team (Current)
Halfak created T203610: Fix diffing issue (mistaken added text).
Wed, Sep 5, 9:38 PM · Scoring-platform-team (Current)
GitHub <noreply@github.com> committed rORESf8f8e45c10f9: Merge pull request #261 from wikimedia/timing_poolcounter (authored by Halfak).
Merge pull request #261 from wikimedia/timing_poolcounter
Wed, Sep 5, 8:06 PM

Tue, Sep 4

Halfak moved T186749: Train and test edit type model for Catalan Wikipedia from Community engagement to New development on the Scoring-platform-team board.
Tue, Sep 4, 3:11 PM · Scoring-platform-team, edittypes-modeling, artificial-intelligence, ORES
Halfak created T203468: Update monthly article quality datasets.
Tue, Sep 4, 3:11 PM · Scoring-platform-team, artificial-intelligence, articlequality-modeling
Halfak updated subscribers of T198982: New Pages Feed: backfill ORES scores for unscored drafts (3.3).

Please let us know via the #wikimedia-ai channel when you run this so that we can monitor the load it puts on ORES. I don't expect a hiccup especially since I expect many of these scores to be pre-cached, but its good to be present and observing a big job like this. :)

Tue, Sep 4, 2:46 PM · MW-1.32-release-notes (WMF-deploy-2018-08-07 (1.32.0-wmf.16)), Patch-For-Review, Growth-Team (Current Sprint), English-Wikipedia-New-Pages-Patrol
Halfak added a comment to T202348: Resume refinement of edit events in Data Lake.

FYI: Here's the proposal I wrote a while ago: https://meta.wikimedia.org/wiki/Schema_talk:Edit#A_proposal_I_wrote_a_while_ago

Tue, Sep 4, 2:23 PM · Analytics-Data-Quality, Analytics, Contributors-Analysis, Product-Analytics
Halfak added a comment to T201868: Merge ORES precaching with ORESFetchScoreJob.

Keying on page title doesn't work because we store scores for revisions historically. Thus revision IDs are necessary. Also, it is important to note that page titles are not a durable identifier. It's not uncommon to rename pages.

Tue, Sep 4, 2:13 PM · ORES, Services (designing), Scoring-platform-team
Halfak added a comment to T200297: Introduce a new namespace for collaborative judgments about wiki entities.

I think that querying by within-judgement content should be very limited (and probably within the page_info table pending a discussion scaling issues there). Complex querying should instead be supported by analytics infrastructure and potentially search infrastructure. MediaWiki's databases are not well suited to analytic querying anyway. We're instead focused on loading streams of judgments into analytics HIVE and producing easy-to-consume database dumps.

Tue, Sep 4, 2:07 PM · Scoring-platform-team (Current), DBA, Operations, JADE, TechCom-RFC
Halfak added a comment to T201146: Build article quality model for Galician Wikipedia.

Ahh yes. It looks like we'll need to sample and generate a labeled dataset like we did for euwiki.

Tue, Sep 4, 1:51 PM · artificial-intelligence, articlequality-modeling, Wikilabels, Scoring-platform-team
Halfak updated subscribers of T201142: Add language support for galician.

@Ladsgroup, can you run the BWDS script on Galacian?

Tue, Sep 4, 1:43 PM · artificial-intelligence, Bad-Words-Detection-System, revscoring, Scoring-platform-team

Aug 20 2018

GitHub <noreply@github.com> committed rOEQfd3301e835d1: Merge pull request #169 from wiki-ai/T197679_new_feature (authored by Halfak).
Merge pull request #169 from wiki-ai/T197679_new_feature
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ2cfd4766482e: Merge pull request #168 from wiki-ai/more_wikidata_features (authored by Halfak).
Merge pull request #168 from wiki-ai/more_wikidata_features
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ80d0282d0214: Merge pull request #166 from wiki-ai/wikidata_second_campaign (authored by Halfak).
Merge pull request #166 from wiki-ai/wikidata_second_campaign
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ6fd33af72975: Merge pull request #164 from wiki-ai/wikidata_second_campaign (authored by Halfak).
Merge pull request #164 from wiki-ai/wikidata_second_campaign
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ3bac2ca8597b: Merge pull request #163 from wiki-ai/dedupe_name (authored by Halfak).
Merge pull request #163 from wiki-ai/dedupe_name
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ320eca8a6d45: Merge pull request #162 from wiki-ai/T196468 (authored by Halfak).
Merge pull request #162 from wiki-ai/T196468
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ971486a6b1de: Merge pull request #160 from wiki-ai/srwiki_damaging (authored by Halfak).
Merge pull request #160 from wiki-ai/srwiki_damaging
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ27e14a1c15ce: Bumps mwreverts requirement (authored by Halfak).
Bumps mwreverts requirement
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ8cf0f74acfed: Merge pull request #153 from wiki-ai/huwiki_merge (authored by Halfak).
Merge pull request #153 from wiki-ai/huwiki_merge
Aug 20 2018, 7:39 PM
Halfak committed rOEQd7ebbbc705fa: Rename model_info directory (#150) (authored by awight).
Rename model_info directory (#150)
Aug 20 2018, 7:39 PM
Halfak committed rOEQa44aa3613188: Simplify template (#144) (authored by awight).
Simplify template (#144)
Aug 20 2018, 7:39 PM
Halfak committed rOEQd54a2eb2149d: Dump a model_info file any time we rebuild a model (#146) (authored by awight).
Dump a model_info file any time we rebuild a model (#146)
Aug 20 2018, 7:39 PM
Halfak committed rOEQe7373b183c58: Isolate codegen (#145) (authored by awight).
Isolate codegen (#145)
Aug 20 2018, 7:39 PM
Halfak committed rOEQ69127fe39eb7: Adds updated enwiktionary model. (authored by Halfak).
Adds updated enwiktionary model.
Aug 20 2018, 7:39 PM
Halfak committed rOEQ363d2ee0b61f: Regenerates Makefile. (authored by Halfak).
Regenerates Makefile.
Aug 20 2018, 7:39 PM
GitHub <noreply@github.com> committed rOEQ3ea080d06314: Merge pull request #136 from wiki-ai/enwiktionary_templating (authored by Halfak).
Merge pull request #136 from wiki-ai/enwiktionary_templating
Aug 20 2018, 7:39 PM
Halfak committed rOEQ0c7c0b88a1ec: Turn English Wiktionary to unbalanced sample (authored by Ladsgroup).
Turn English Wiktionary to unbalanced sample
Aug 20 2018, 7:38 PM
Halfak committed rOEQb2f9b2c7aa6e: Enable downsizing review samples and apply it on enwiktionary (authored by Ladsgroup).
Enable downsizing review samples and apply it on enwiktionary
Aug 20 2018, 7:38 PM
Halfak committed rOEQ7759a3149118: Simplify enwiktionary to use 92k observations for the base dataset. (authored by Halfak).
Simplify enwiktionary to use 92k observations for the base dataset.
Aug 20 2018, 7:38 PM
Halfak committed rOEQ067e679b70aa: Template English Wiktionary (authored by Ladsgroup).
Template English Wiktionary
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ5734a7224371: Merge pull request #141 from wiki-ai/template_more (authored by Halfak).
Merge pull request #141 from wiki-ai/template_more
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQbbc4af7f30c0: Merge pull request #137 from wiki-ai/fix_quote (authored by Halfak).
Merge pull request #137 from wiki-ai/fix_quote
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQec30dfb7fd53: Merge pull request #133 from wiki-ai/fix_merge_attempt_two (authored by Halfak).
Merge pull request #133 from wiki-ai/fix_merge_attempt_two
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQf0c73450a004: Merge pull request #132 from wiki-ai/template_nowiki_viwiki (authored by Halfak).
Merge pull request #132 from wiki-ai/template_nowiki_viwiki
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQe2609fe72557: Merge pull request #124 from wiki-ai/make_automated (authored by Halfak).
Merge pull request #124 from wiki-ai/make_automated
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ3b2d92f67735: Merge pull request #123 from wiki-ai/makefile_cleanup (authored by Halfak).
Merge pull request #123 from wiki-ai/makefile_cleanup
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ16db8d8fa09c: Merge pull request #115 from wiki-ai/more_templating (authored by Halfak).
Merge pull request #115 from wiki-ai/more_templating
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ2747c84a96c8: Merge pull request #111 from wiki-ai/use_nltk_stopwords (authored by Halfak).
Merge pull request #111 from wiki-ai/use_nltk_stopwords
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ88566e225bda: Merge pull request #113 from wiki-ai/cawiki_reverted (authored by Halfak).
Merge pull request #113 from wiki-ai/cawiki_reverted
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQcc760a1fcbee: Merge pull request #108 from wiki-ai/eswikiquote (authored by Halfak).
Merge pull request #108 from wiki-ai/eswikiquote
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ19bbedb9cc07: Merge pull request #107 from wiki-ai/iswiki_reverted (authored by Halfak).
Merge pull request #107 from wiki-ai/iswiki_reverted
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQc20a589572c2: Merge pull request #106 from wiki-ai/wd_again (authored by Halfak).
Merge pull request #106 from wiki-ai/wd_again
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ214401574c90: Merge pull request #104 from wiki-ai/no_nose (authored by Halfak).
Merge pull request #104 from wiki-ai/no_nose
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ2b6581b901ba: Merge pull request #103 from wiki-ai/update_mwtypes (authored by Halfak).
Merge pull request #103 from wiki-ai/update_mwtypes
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ5fa12bb6c3c5: Merge pull request #102 from wiki-ai/srwiki_campaign (authored by Halfak).
Merge pull request #102 from wiki-ai/srwiki_campaign
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQeb7910f3d2f0: Merge pull request #100 from wiki-ai/eswikibooks (authored by Halfak).
Merge pull request #100 from wiki-ai/eswikibooks
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ49ba211f7c3d: Deletes eswikibooks.reverted model (authored by Halfak).
Deletes eswikibooks.reverted model
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ27f66e889f21: Merge pull request #95 from wiki-ai/hrwiki (authored by Halfak).
Merge pull request #95 from wiki-ai/hrwiki
Aug 20 2018, 7:38 PM
Halfak committed rOEQe52ab01cd479: Adds configurable number of extractors to Makefile. (authored by Halfak).
Adds configurable number of extractors to Makefile.
Aug 20 2018, 7:38 PM
Halfak committed rOEQ4b5cbff8bd74: Rebuilds models for revscoring 2.0.8 (authored by Halfak).
Rebuilds models for revscoring 2.0.8
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ7238337187b6: Merge pull request #96 from wiki-ai/eswiki_damaging (authored by Halfak).
Merge pull request #96 from wiki-ai/eswiki_damaging
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQ04c249593392: Merge pull request #98 from wiki-ai/clean_reverted (authored by Halfak).
Merge pull request #98 from wiki-ai/clean_reverted
Aug 20 2018, 7:38 PM
Halfak committed rOEQd224ea166f2b: Ad eswiki damaging and goodfaith (authored by Ladsgroup).
Ad eswiki damaging and goodfaith
Aug 20 2018, 7:38 PM
Halfak committed rOEQac18f353bd9c: Fixes tuning commands in makefile and regenerates tuning reports for all models. (authored by Halfak).
Fixes tuning commands in makefile and regenerates tuning reports for all models.
Aug 20 2018, 7:38 PM
Halfak committed rOEQ81a1ec67b4e2: Adds models and tuning reports for svwiki damaging/goodfaith (authored by Halfak).
Adds models and tuning reports for svwiki damaging/goodfaith
Aug 20 2018, 7:38 PM
Halfak committed rOEQ3d5e1145b40d: Adds some Makefile rules for svwiki damaging/goodfaith. (authored by Halfak).
Adds some Makefile rules for svwiki damaging/goodfaith.
Aug 20 2018, 7:38 PM
Halfak committed rOEQ74ba5034ff06: Adds join_observations utility. (authored by Halfak).
Adds join_observations utility.
Aug 20 2018, 7:38 PM
Halfak committed rOEQ192391004c9a: Updates tuning params for revscoring 2.0 (authored by Halfak).
Updates tuning params for revscoring 2.0
Aug 20 2018, 7:38 PM
GitHub <noreply@github.com> committed rOEQf19c4ec291a8: Merge pull request #91 from wiki-ai/fawiki_damaging_v2 (authored by Halfak).
Merge pull request #91 from wiki-ai/fawiki_damaging_v2
Aug 20 2018, 7:38 PM
Halfak committed rOEQ4fed9dc9d6a0: Updates models for revscoring 2.0 (authored by Halfak).
Updates models for revscoring 2.0
Aug 20 2018, 7:38 PM
Halfak committed rOEQ60e4e6ee0516: Minor fixes in Makefile (re. awight's review) (authored by Halfak).
Minor fixes in Makefile (re. awight's review)
Aug 20 2018, 7:38 PM
Halfak committed rOEQd1e24d9ef2ff: Adds sqwiki to default model build (authored by Halfak).
Adds sqwiki to default model build
Aug 20 2018, 7:38 PM