<feed xmlns="http://www.w3.org/2005/Atom"><title>Score all the things</title><id>https://phabricator.wikimedia.org/phame/blog/feed/8/</id><link rel="self" type="application/atom+xml" href="https://phabricator.wikimedia.org/phame/blog/feed/8/" /><updated>2023-04-13T21:52:02+00:00</updated><entry><title>Status Update (May 2, 2018)</title><link href="/phame/live/8/post/104/status_update_may_2_2018/" /><id>https://phabricator.wikimedia.org/phame/post/view/104/</id><author><name>awight (Adam Wight)</name></author><published>2018-05-02T20:05:02+00:00</published><updated>2018-05-02T20:39:54+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><h3 class="remarkup-header">Highlights</h3>

<ul class="remarkup-list">
<li class="remarkup-list-item">We&#039;ve started work on <a href="https://www.mediawiki.org/wiki/JADE" class="remarkup-link remarkup-link-ext" rel="noreferrer">JADE</a> in earnest, and the prototype is deployed to the beta cluster where it&#039;s available for testing and tool development.</li>
<li class="remarkup-list-item">Draft topic prerequisites are mostly falling into place, so we should be able to get the initial model deployed this month.</li>
<li class="remarkup-list-item">New, dynamic ORES support table shows up-to-date information about our progress for each wiki: <a href="https://tools.wmflabs.org/ores-support-checklist/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://tools.wmflabs.org/ores-support-checklist/</a></li>
<li class="remarkup-list-item">ORES is served from its own cluster, which gave us a tremendous benefit in both performance and stability.</li>
<li class="remarkup-list-item">More ORES support for Arabic, Bengali, Catalan, Hungarian, Latvian, Swedish Wikipedia</li>
</ul>

<h3 class="remarkup-header">Outreach</h3>

<p><a href="/T121719" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_0"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T121719: [Epic] Write paper about ORES as a socio-technical probe</span></span></a><br />
<a href="/T188123" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_1"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188123: Present about draft topic model at Wikimedia Research Showcase.</span></span></a><br />
<a href="/T188124" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_2"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188124: Build slide deck about AI at Wikimedia for Policy People</span></span></a><br />
<a href="/T190464" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_3"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T190464: Discuss surfacing ORES for AFC/NPP</span></span></a></p>

<h3 class="remarkup-header">Draft topic</h3>

<p><a href="/T123327" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_4"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T123327: Train/test draft topic model (new article routing AI)</span></span></a><br />
<a href="/T185147" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_5"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T185147: Host Google-News-word2vec.bin publicly</span></span></a><br />
<a href="/T185896" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_6"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T185896: OneVsRest Classification for revscoring</span></span></a><br />
<a href="/T188445" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_7"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188445: Implement word2vec featurevector in revscoring</span></span></a><br />
<a href="/T189364" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_8"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T189364: Investigate word2vec memory issues with multiprocessing</span></span></a></p>

<h3 class="remarkup-header">Article quality</h3>

<p><a href="/T127470" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_9"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T127470: Deploy item quality classification model for Wikidata</span></span></a><br />
<a href="/T158437" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_10"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T158437: Change ORES rules to send all events to new &quot;/precache&quot; endpoint</span></span></a><br />
<a href="/T174684" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_11"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174684: Article quality campaign for Persian Wikipedia</span></span></a><br />
<a href="/T181522" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_12"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181522: Fix response processing logic in drafttopic.fetch_page_wikiprojects</span></span></a><br />
<a href="/T181820" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_13"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181820: Rename &quot;wikiclass&quot; to &quot;articlequality&quot;</span></span></a><br />
<a href="/T187927" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_14"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187927: Drop &quot;non bot&quot; condition from ORES changeprop rules</span></span></a></p>

<h3 class="remarkup-header">Edit quality</h3>

<p><a href="/T130279" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_15"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T130279: Deploy ORES filters to Spanish Wikipedia</span></span></a><br />
<a href="/T131669" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_16"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T131669: Complete edit quality campaign for Arabic Wikipedia</span></span></a><br />
<a href="/T145394" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_17"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T145394: Deploy ORES filters in es.wikibooks</span></span></a><br />
<a href="/T163005" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_18"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163005: Complete Latvian Wikipedia editquality campaign</span></span></a><br />
<a href="/T163006" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_19"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163006: Train/test damaging &amp; goodfaith models for Latvian Wikipedia</span></span></a><br />
<a href="/T166235" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_20"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166235: Flagged revs approve model to fiwiki</span></span></a><br />
<a href="/T167968" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_21"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167968: Complete edit quality campaign for Hungarian Wikipedia</span></span></a><br />
<a href="/T174560" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_22"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174560: Enable ORES filters for svwiki</span></span></a><br />
<a href="/T174878" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_23"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174878: Edit quality campaign for Bengali Wikipedia</span></span></a><br />
<a href="/T182012" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_24"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182012: Deploy ORES filters to Simple Wikipedia</span></span></a><br />
<a href="/T185903" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_25"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T185903: Train/test damaging and goodfaith model for Hungarian Wikipedia</span></span></a><br />
<a href="/T186711" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_26"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T186711: Edit quality campaign for Bengali Wikisource</span></span></a><br />
<a href="/T187503" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_27"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187503: Bengali Wikisource not recognized as a wikimedia wiki</span></span></a><br />
<a href="/T187732" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_28"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187732: Train/test damaging/goodfaith model for Catalan Wikipedia</span></span></a><br />
<a href="/T187742" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_29"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187742: I broke merge_labels utility so bad</span></span></a><br />
<a href="/T187771" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_30"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187771: Edit quality campaign for Catalan Wikipedia</span></span></a><br />
<a href="/T187836" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_31"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187836: [Epic] Audit of pending ORES GUI deployments</span></span></a><br />
<a href="/T188270" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_32"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188270: Simplify hewiki, cswiki, plwiki, svwiki</span></span></a><br />
<a href="/T188535" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_33"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188535: JSON error in building models</span></span></a><br />
<a href="/T188781" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_34"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188781: Simplify azwiki&#039;s labeling campaign</span></span></a><br />
<a href="/T189710" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_35"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T189710: Train and test damaging/goodfaith model for arwiki</span></span></a><br />
<a href="/T192362" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_36"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T192362: Nulls appear in labeled data (merge_labels issue)</span></span></a></p>

<h3 class="remarkup-header">Judgment and dialogue engine</h3>

<p><a href="/T153152" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_37"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T153152: Design JADE data storage schema</span></span></a><br />
<a href="/T176333" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_38"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T176333: Deploy JADE prototype in Beta Cluster</span></span></a><br />
<a href="/T182055" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_39"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182055: Vagrant role for JADE</span></span></a><br />
<a href="/T187216" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_40"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187216: Build prototype JADE extension</span></span></a><br />
<a href="/T189938" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_41"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T189938: Create mediawiki.org article for Extension:JADE</span></span></a></p>

<h3 class="remarkup-header">Infrastructure</h3>

<p><a href="/T168073" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_42"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168073: Switch ORES to dedicated cluster</span></span></a><br />
<a href="/T168455" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_43"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168455: [Epic] Implement code generation for model makefile maintenance</span></span></a><br />
<a href="/T169246" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_44"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169246: Stress/capacity test new ores* cluster</span></span></a><br />
<a href="/T171851" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_45"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T171851: Reimage ores* hosts with Debian Stretch</span></span></a><br />
<a href="/T180587" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_46"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180587: Scream properly when threshold stats fetch is failing</span></span></a><br />
<a href="/T181071" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_47"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181071: Cache ORES virtualenv within versioned source</span></span></a><br />
<a href="/T181159" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_48"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181159: Migrate ORES extension threshold config from old to new syntax</span></span></a><br />
<a href="/T182799" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_49"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182799: Make sure ORES is compatible with stretch</span></span></a><br />
<a href="/T184072" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_50"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184072: Rebuild ORES models on Stretch</span></span></a><br />
<a href="/T184073" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_51"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184073: Provision a Stretch box we can use to build ORES models</span></span></a><br />
<a href="/T184074" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_52"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184074: Verify that all enchant/spelling dictionaries are available on Stretch.  Port if needed.</span></span></a><br />
<a href="/T184077" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_53"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184077: ORES MediaWiki-Vagrant roles should be ported to Stretch</span></span></a><br />
<a href="/T184135" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_54"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184135: Rebuild ORES wheels on Stretch</span></span></a><br />
<a href="/T184451" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_55"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184451: Selenium tests for ORES</span></span></a><br />
<a href="/T184938" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_56"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184938: Special:RecentChanges broken on Jenkins slaves</span></span></a><br />
<a href="/T185901" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_57"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T185901: Preliminary deployment of ORES to new cluster</span></span></a><br />
<a href="/T186453" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_58"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T186453: Fix or decide on edge cases of Makefile</span></span></a><br />
<a href="/T186557" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_59"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T186557: WikiLabels OAuth handshake doesn&#039;t work with HTTPS</span></span></a><br />
<a href="/T187914" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_60"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T187914: New precache endpoint isn&#039;t reporting its metrics correctly</span></span></a><br />
<a href="/T188777" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_61"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188777: makefile generate makes impossible rules when no model is available</span></span></a><br />
<a href="/T189605" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_62"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T189605: Beta cluster ORES is emitting statsd errors</span></span></a><br />
<a href="/T189781" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_63"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T189781: Reboot oresrdb</span></span></a><br />
<a href="/T189948" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_64"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T189948: Clean up old backward compatibility settings of $wgOresModels</span></span></a><br />
<a href="/T189954" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_65"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T189954: Build ORES support checklist</span></span></a><br />
<a href="/T191321" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_66"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T191321: Remove deprecated hosts from ORES scap config</span></span></a><br />
<a href="/T191435" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_67"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T191435: Highlight wikis that has their campaign almost completed</span></span></a><br />
<a href="/T191808" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_68"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T191808: Fix the ORES-extension grafana dashboard</span></span></a><br />
<a href="/T192038" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_69"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T192038: Upgrade ORES mwparserfromhell dependency to 0.5.1</span></span></a></p>

<h3 class="remarkup-header">Revscoring</h3>

<p><a href="/T179874" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_70"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179874: Specter of revscoring 1 haunting ORES</span></span></a><br />
<a href="/T181163" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_71"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181163: Revscoring tune does not recognize a set of labels as target</span></span></a><br />
<a href="/T188755" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_72"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T188755: Update ORES requirements to support revscoring 2.2.0 </span></span></a><br />
<a href="/T192917" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_73"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T192917: Rebuild all models for revscoring 2.2.2</span></span></a></p>

<h3 class="remarkup-header">ORES extension</h3>

<p><a href="/T181334" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_74"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181334: Split Cache.php to different services</span></span></a><br />
<a href="/T181335" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_75"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181335: Redesign onRecentChange_save hook handler for ORES</span></span></a><br />
<a href="/T181892" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_76"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181892: Rewrite Stats.php</span></span></a><br />
<a href="/T183199" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_77"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183199: ORES Extension refactoring</span></span></a><br />
<a href="/T183266" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_78"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183266: wikidata.beta.wmflabs.org/wiki/Special:RecentChanges InvalidArgumentException  No model available for [goodfaith]</span></span></a><br />
<a href="/T185534" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_79"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T185534: Clean up ScoreLookup implementations</span></span></a></p></div></content></entry><entry><title>Status Update (January 30, 2018)</title><link href="/phame/live/8/post/84/status_update_january_30_2018/" /><id>https://phabricator.wikimedia.org/phame/post/view/84/</id><author><name>awight (Adam Wight)</name></author><published>2018-01-30T19:24:55+00:00</published><updated>2018-03-25T07:49:06+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><h3 class="remarkup-header">Highlights</h3>

<ul class="remarkup-list">
<li class="remarkup-list-item">Deployed Revscoring 2.0.  Each scoring model includes statistics that can be used to query and choose an appropriate threshold depending on the use case.</li>
<li class="remarkup-list-item">Rewrote ORES extension, improving code quality and test coverage.  Failures will cause graceful degradation rather than breaking pages that rely on ORES.</li>
<li class="remarkup-list-item">GCI happened and some work has been done on wikilabels.</li>
<li class="remarkup-list-item">The ORES labs cluster has been migrated to Debian Stretch, and we&#039;re ready to migrate production clusters.</li>
<li class="remarkup-list-item">&quot;draft topic&quot; model is trained and it works.  Support for the model in ORES is ongoing.</li>
<li class="remarkup-list-item">New languages, new campaigns, new models.  We&#039;ve deployed advanced edit quality models to Simple English, Spanish, and Swedish Wikipedia, Spanish Wikibooks, and basic edit quality to Icelandic Wikipedia and Spanish Wikiquote.  Preliminary edit quality campaigns are finished for Hungarian and Serbian Wikipedia.</li>
<li class="remarkup-list-item">JADE (auditing system) work is continuing, we have a database schema designed, some code written for the backend service, and have planned an event-based architecture plus content-handled Jade and Jade_talk namespaces within MediaWiki.</li>
<li class="remarkup-list-item">Draftquality data is cached in the ORES extension and is made available to other extensions.</li>
</ul>

<h3 class="remarkup-header">🤔</h3>

<p><a href="/T166045" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_80"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166045: Scoring platform team FY18 Q1</span></span></a><br />
<a href="/T178428" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_81"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178428: Respond to press inquiry re. algorithms &amp; bots in Wikipedia </span></span></a><br />
<a href="/T182823" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_82"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182823: Talk to reporter from OZY</span></span></a></p>

<h3 class="remarkup-header">Draft topic</h3>

<p><a href="/T172321" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_83"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T172321: Build mid-level WikiProject category training set</span></span></a><br />
<a href="/T179311" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_84"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179311: Generate mid-level WikiProject categories</span></span></a><br />
<a href="/T181166" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_85"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181166: Revscoring: Statistic for multilabel classification</span></span></a><br />
<a href="/T183580" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_86"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183580: class weights support for multilabel classification</span></span></a></p>

<h3 class="remarkup-header">JADE</h3>

<p><a href="/T170954" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_87"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170954: Set up working group for JADE</span></span></a><br />
<a href="/T174685" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_88"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174685: Create list of ORES collaborators (focus on language asset helpers)</span></span></a><br />
<a href="/T175192" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_89"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175192: Design JADE scoring schema</span></span></a><br />
<a href="/T178101" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_90"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178101: Post about judgments/endorsements/preference</span></span></a><br />
<a href="/T178102" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_91"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178102: Post about suppression for JADE comments</span></span></a><br />
<a href="/T178103" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_92"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178103: Post about public analytics for JADE</span></span></a><br />
<a href="/T179298" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_93"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179298: Post about splitting &quot;revision&quot; and &quot;edit&quot; conceptually</span></span></a><br />
<a href="/T181098" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_94"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181098: Implement basic path structure for JADE (judgements)</span></span></a><br />
<a href="/T183598" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_95"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183598: Post about using MCR for JADE</span></span></a><br />
Potential event-passing design, <a href="https://www.mediawiki.org/wiki/File:JADE_event_schema.svg" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/File:JADE_event_schema.svg</a></p>

<h3 class="remarkup-header">Infrastructure</h3>

<p><a href="/T159105" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_96"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T159105: ORES services should have vagrant roles</span></span></a><br />
<a href="/T168672" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_97"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168672: Grant AWight admin access to ORES pypi repos</span></span></a><br />
<a href="/T173251" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_98"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T173251: Have CI merge research/ores/wheels changes</span></span></a><br />
<a href="/T174660" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_99"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174660: [Discuss] Moving from nosetests to pytest</span></span></a><br />
<a href="/T175651" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_100"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175651: Grafana has confusing or wrong scale for &quot;scores errored&quot; graph</span></span></a><br />
<a href="/T176914" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_101"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T176914: Wire statistics into test model included with our Vagrant role.</span></span></a><br />
<a href="/T179862" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_102"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179862: Keep statistics about ores service hits for storing thresholds</span></span></a><br />
<a href="/T181067" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_103"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181067: Parallelize scap for ORES</span></span></a><br />
<a href="/T181544" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_104"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181544: Investigate scb1001 and scb1002 available memory graphs in Grafana</span></span></a><br />
<a href="/T184077" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_105"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184077: ORES MediaWiki-Vagrant roles should be ported to Stretch</span></span></a></p>

<h3 class="remarkup-header">Wikilabels</h3>

<p><a href="/T139957" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_106"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T139957: Add &quot;info&quot; URL to campaign data so that we can link to campaign page</span></span></a><br />
<a href="/T155116" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_107"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T155116: Develop a backup strategy for campaigns/tasks/labels</span></span></a><br />
<a href="/T155440" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_108"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T155440: Add notice to on-wiki labeling pages (e.g. en:WP:Labels) about deprecation.</span></span></a><br />
<a href="/T171768" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_109"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T171768: Allow Wiki Labels API to list inactive campaigns</span></span></a><br />
<a href="/T175724" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_110"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175724: Oauth login does not return the user to the previous page</span></span></a><br />
<a href="/T175726" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_111"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175726: Error messages should not contain relative paths or error codes</span></span></a><br />
<a href="/T176331" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_112"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T176331: Deploy edit quality models for eswikibooks</span></span></a><br />
<a href="/T178004" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_113"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178004: Add list of labelers to campaign stats (sort by labels submitted)</span></span></a><br />
<a href="/T179015" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_114"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179015: Introduce and create pytest for flask application of the wikilabels AI service</span></span></a><br />
<a href="/T179296" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_115"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179296: Remove usages of nosetest and replace it with pytest</span></span></a><br />
<a href="/T183068" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_116"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183068: Wikilabel interface in Hungarian has uninformative action buttons, due to translations not updating</span></span></a><br />
<a href="/T183196" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_117"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183196: Deploy wikilabels mid-December 2017</span></span></a></p>

<h3 class="remarkup-header">ORES</h3>

<h4 class="remarkup-header">Migrating to stretch:</h4>

<p><a href="/T182799" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_118"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182799: Make sure ORES is compatible with stretch</span></span></a><br />
<a href="/T184296" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_119"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184296: Convert CloudVPS instances to stretch. </span></span></a><br />
<a href="/T184765" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_120"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184765: Back up ores-misc-01 to ores-staging-01</span></span></a><br />
<a href="/T184766" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_121"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184766: Convert ores-misc-01 to stretch</span></span></a></p>

<h4 class="remarkup-header">Migrating to revscoring 2.0</h4>

<p><a href="/T175180" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_122"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175180: Deploy ORES (revscoring 2.0)</span></span></a><br />
<a href="/T179712" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_123"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179712: ORES 500s when model_info lookup fails due to a key error</span></span></a><br />
<a href="/T179838" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_124"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179838: Update ORES deploy wheels with revscoring 2.0.9</span></span></a><br />
<a href="/T179296" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_125"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179296: Remove usages of nosetest and replace it with pytest</span></span></a></p>

<p><a href="/T163786" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_126"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163786: Make ORES documentation translatable</span></span></a><br />
<a href="/T174402" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_127"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174402: Review and fix file handle management in worker and celery processes</span></span></a><br />
<a href="/T175627" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_128"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175627: UK dictionary broken in production</span></span></a><br />
<a href="/T178668" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_129"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178668: /scores/&lt;context&gt; stopped working</span></span></a><br />
<a href="/T179064" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_130"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179064: ORES internal server error for edit with many added links</span></span></a><br />
<a href="/T179098" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_131"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179098: Deployment to canary causes an import error on docopt</span></span></a><br />
<a href="/T179509" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_132"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179509: id string in ORES does not include &quot;features&quot;</span></span></a><br />
<a href="/T179629" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_133"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179629: Add link to FAQ to ORES homepage</span></span></a><br />
<a href="/T179711" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_134"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179711: ORES 500 errors on a threshold lookup request</span></span></a><br />
<a href="/T179837" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_135"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179837: Deploy ORES early Nov 2017</span></span></a><br />
<a href="/T180115" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_136"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180115: [regression] ORES filters are not available on French Wikipedia anymore</span></span></a><br />
<a href="/T180496" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_137"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180496: Clean up ORES wheels Makefile</span></span></a><br />
<a href="/T181103" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_138"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181103: Announce the ORES FAQ</span></span></a><br />
<a href="/T181183" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_139"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181183: Improvements to ORES deployment documentation and process</span></span></a><br />
<a href="/T181187" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_140"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181187: ORES beta cluster config should be as close to production as possible</span></span></a><br />
<a href="/T182614" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_141"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182614: Investigate why ORES logs are being written to syslog despite explicit logging config.  Fix.</span></span></a><br />
<a href="/T184276" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_142"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184276: Beta Cluster ORES celery worker dies</span></span></a><br />
<a href="/T184282" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_143"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184282: Move beta cluster ORES to its own machine</span></span></a><br />
<a href="/T185148" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_144"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T185148: Update docs, monitoring, etc. for new labs servers</span></span></a></p>

<h3 class="remarkup-header">Editquality (Vandalism detection)</h3>

<p><a href="/T167968" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_145"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167968: Complete edit quality campaign for Hungarian Wikipedia</span></span></a><br />
<a href="/T174558" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_146"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174558: Deploy damaging/goodfaith model for svwiki</span></span></a><br />
<a href="/T176134" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_147"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T176134: Train &amp; test damaging/goodfaith model for eswiki</span></span></a><br />
<a href="/T176332" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_148"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T176332: Train/test edit quality models for eswikibooks</span></span></a><br />
<a href="/T177762" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_149"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T177762: Edit quality campaign for es.wikiquote</span></span></a><br />
<a href="/T178108" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_150"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178108: Edit quality campaign for Serbian Wikipedia</span></span></a><br />
<a href="/T180686" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_151"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180686: Wikidata beta edit filters are showing every edit in watchlist as damaging</span></span></a><br />
<a href="/T181099" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_152"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181099: Train/test reverted model for Icelandic</span></span></a><br />
<a href="/T181848" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_153"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181848: Experiment with using English Wikipedia models on Simple English</span></span></a><br />
<a href="/T181849" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_154"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181849: Edit quality campaign for simple.wikipedia.org</span></span></a></p>

<h3 class="remarkup-header">Revscoring</h3>

<h4 class="remarkup-header">New languages</h4>

<p><a href="/T178524" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_155"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178524: Add language support for Icelandic</span></span></a><br />
<a href="/T182612" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_156"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182612: Implement language support for Catalan</span></span></a></p>

<p><a href="/T175180" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_157"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175180: Deploy ORES (revscoring 2.0)</span></span></a><br />
<a href="/T175627" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_158"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175627: UK dictionary broken in production</span></span></a><br />
<a href="/T177544" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_159"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T177544: Revscoring 2.0 takes up too much memory</span></span></a><br />
<a href="/T177636" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_160"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T177636: Reduce label_thresholds granularity</span></span></a><br />
<a href="/T179507" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_161"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179507: Compare coverage reports of migrating to pytest</span></span></a></p>

<h3 class="remarkup-header">ORES extension</h3>

<h4 class="remarkup-header">Extension rewrite</h4>

<p><a href="/T177421" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_162"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T177421: Cached thresholds should be invalidated for new model versions.</span></span></a><br />
<a href="/T178792" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_163"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T178792: ORESFetchScoreJob: RuntimeException No model available for [goodfaith]</span></span></a><br />
<a href="/T181334" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_164"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181334: Split Cache.php to different services</span></span></a><br />
<a href="/T181892" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_165"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181892: Rewrite Stats.php</span></span></a><br />
<a href="/T182111" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_166"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182111: Cached thresholds should be purged when model version is incremented</span></span></a><br />
<a href="/T182942" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_167"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T182942: Tests should have covered regression in T182936</span></span></a><br />
<a href="/T183468" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_168"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183468: Deprecate CheckModelVersions and integrate it with the extension workflow</span></span></a><br />
<a href="/T183762" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_169"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183762: oresscores is not working at all</span></span></a><br />
<a href="/T184127" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_170"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184127: Add models when initializing the table</span></span></a><br />
<a href="/T184140" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_171"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184140: Increase coverage of ORES extension</span></span></a><br />
<a href="/T184142" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_172"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184142: Refactor Scoring.php</span></span></a><br />
<a href="/T184554" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_173"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184554: Deprecate Extension:ORES &quot;beta&quot; mode</span></span></a><br />
<a href="/T184775" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_174"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T184775: How do I test my extension&#039;s maintenance scripts?</span></span></a></p>

<p><a href="/T154175" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_175"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T154175: Clean up failure ratio monitoring and set up an alarm when it goes more than a certain threshold</span></span></a><br />
<a href="/T175053" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_176"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175053: Make RCFilters compatible with both the old and new thresholds APIs</span></span></a><br />
<a href="/T176183" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_177"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T176183: Store draftquality data in ores extension</span></span></a><br />
<a href="/T176588" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_178"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T176588: Query action on API returns the rvcontinue value that point to itself causes infinite loop</span></span></a><br />
<a href="/T179107" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_179"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179107: ORES service erroring, in a way that throws exceptions in Extension:ORES</span></span></a><br />
<a href="/T179430" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_180"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179430: ORES extension failing to parse scoring response</span></span></a><br />
<a href="/T179596" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_181"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179596: Enable draftquality model in ORES extension for enwiki</span></span></a><br />
<a href="/T179602" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_182"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179602: Rewind revscoring 1/2 compatibility hacks</span></span></a><br />
<a href="/T179830" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_183"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179830: Notice: Undefined property: stdClass::$ores_damaging_threshold in /srv/mediawiki/php-1.31.0-wmf.6/extensions/ORES/includes/Hooks.php on line 602</span></span></a><br />
<a href="/T179861" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_184"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179861: Collect all data for draftquality model in enwiki</span></span></a><br />
<a href="/T179862" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_185"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T179862: Keep statistics about ores service hits for storing thresholds</span></span></a><br />
<a href="/T180026" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_186"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180026: Drop oresc_rev_predicted_model index</span></span></a><br />
<a href="/T180045" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_187"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180045: Review and deploy schema change on dropping oresc_rev_predicted_model index</span></span></a><br />
<a href="/T180450" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_188"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180450: ORES thresholds for Wikidata is too strict</span></span></a><br />
<a href="/T180633" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_189"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T180633: ORES RC filters missing in beta cluster, fetching thresholds fails</span></span></a><br />
<a href="/T181006" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_190"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181006: Watchlist and RecentChanges failure due to ORES on frwiki and ruwiki</span></span></a><br />
<a href="/T181010" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_191"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181010: [Spike] Write reports about why Ext:ORES is helping cause server 500s and write tasks to fix</span></span></a><br />
<a href="/T181168" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_192"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181168: Replicate RC/WL failures in Beta</span></span></a><br />
<a href="/T181191" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_193"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T181191: Make ORES-consuming pages more robust to ORES errors</span></span></a><br />
<a href="/T183862" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_194"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T183862: Recent Changes is broken on Dutch Wikipedia Beta on Beta Cluster</span></span></a></p></div></content></entry><entry><title>Status update (October 6, 2017)</title><link href="/phame/live/8/post/77/status_update_october_6_2017/" /><id>https://phabricator.wikimedia.org/phame/post/view/77/</id><author><name>awight (Adam Wight)</name></author><published>2017-10-18T17:56:36+00:00</published><updated>2017-10-18T18:53:45+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p><em>New language support for Bengali, Greek, and Tamil.  New advance edit quality support for Albanian and Romanian.  We cleaned up the old &#039;reverted&#039; models where better support is available.  We&#039;re working on moving to a new dedicated cluster.  We improved some models by exploring new sources of signal and cleaning datasets.  We started work on JADE and presented on The Keilana Effect at Wikimania.</em></p>

<p>See more details below.</p>

<h3 class="remarkup-header">New language support</h3>

<p>We deployed basic edit quality support for Bengali, Greek, and Tamil.  We&#039;ve deployed advanced edit quality support for Albanian, Romanian.  Progress was made towards new models for Latvian, Croatian, Bosnian, and Spanish, but these aren&#039;t deployed yet.</p>

<p><a href="/T166049" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_196"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166049: Deploy reverted model for elwiki</span></span></a><br />
<a href="/T156357" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_197"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T156357: Deploy edit quality campaign for Romanian Wikipedia</span></span></a><br />
<a href="/T163009" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_198"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163009: Train/test damaging &amp; goodfaith models for Albanian Wikipedia</span></span></a><br />
<a href="/T162031" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_199"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162031: Add language support for Latvian (lv)</span></span></a><br />
<a href="/T166048" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_200"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166048: Deploy reverted model for tawiki</span></span></a><br />
<a href="/T170490" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_201"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170490: Train reverted model for Bengali Wikipedia</span></span></a><br />
<a href="/T170491" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_202"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170491: Train reverted model for Greek Wikipedia</span></span></a><br />
<a href="/T174572" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_203"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174572: Reverted model for hrwiki</span></span></a><br />
<a href="/T173087" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_204"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T173087: Add language support for Bosnian</span></span></a><br />
<a href="/T175628" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_205"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175628: Add LV dictionary to install. </span></span></a><br />
<a href="/T172046" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_206"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T172046: Add language support for Croatian (hr.wiki)</span></span></a><br />
<a href="/T131963" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_207"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T131963: Complete eswiki edit quality campaign</span></span></a><br />
<a href="/T174687" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_208"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T174687: Add language support for Serbian</span></span></a></p>

<p>See this full table for reference,<br />
<a href="https://www.mediawiki.org/wiki/ORES/Support_table" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/ORES/Support_table</a></p>

<h3 class="remarkup-header">Moving to the new, dedicated cluster</h3>

<p>Until now, we&#039;ve been running ORES on a shared Services cluster.  We&#039;re happy to announce that the ORES API will be served from a dedicated cluster, probably in a matter of weeks.  Stress tests showed some issues that we&#039;re still resolving.<br />
<a href="/T117560" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_209"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T117560: New Service Request: ORES</span></span></a><br />
<a href="/T169246" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_210"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169246: Stress/capacity test new ores* cluster</span></span></a></p>

<h3 class="remarkup-header">Cleaning up Wikilabels data</h3>

<p><a href="https://phabricator.wikimedia.org/p/Natalia/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_226"><span class="phui-tag-core phui-tag-color-person">@Natalia</span></a> found some systematic errors in our training data, and corrected several.  We also improved the structure of the labeling form to make it more difficult to make cognitive mistakes while labeling.</p>

<p><a href="https://meta.wikimedia.org/wiki/Research_talk:Automated_classification_of_edit_quality/Work_log/2017-07-24" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Research_talk:Automated_classification_of_edit_quality/Work_log/2017-07-24</a><br />
<a href="/T171491" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_211"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T171491: Unlabeled goodfaith observations are assumed &quot;false&quot; -- should be &quot;true&quot;</span></span></a><br />
<a href="/T171497" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_212"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T171497: Review training set to check strange examples of labels</span></span></a><br />
<a href="/T171493" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_213"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T171493: Change &quot;yes/no&quot; in damaging_goodfaith form to &quot;damaging/good&quot; and  &quot;good-faith/bad-faith&quot;</span></span></a></p>

<h3 class="remarkup-header">Maintenance and documentation</h3>

<p>We&#039;ve been working with Releng on git-lfs (Large File Storage) so that our repositories won&#039;t be so big but we&#039;ll still be able to maintain historical model versions.  <br />
<a href="/T171619" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_214"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T171619: [Epic] ORES should use a git large file plugin for storing serialized binaries</span></span></a></p>

<p>We were able to begin work with <a href="https://phabricator.wikimedia.org/p/srodlund/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_227"><span class="phui-tag-core phui-tag-color-person"><span class="phui-tag-dot phui-tag-color-grey"></span>@srodlund</span></a> to improve our technical and user documentation.</p>

<p>See <a href="https://www.mediawiki.org/wiki/ORES/FAQ" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/ORES/FAQ</a></p>

<h3 class="remarkup-header">Remove &quot;reverted&quot; model where advanced editquality models are available</h3>

<p>This was a noteworthy cleanup: on any wiki where the &quot;damaging&quot; and &quot;goodfaith&quot; models are available, these should be used instead of the &quot;reverted&quot; model.  To that end, we&#039;re removing the reverted model from these wikis.  <a href="https://lists.wikimedia.org/pipermail/ai/2017-July/000192.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">We held an RFC and no concerns were raised.</a><br />
<a href="/T171059" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_215"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T171059: [RfC] Should we remove all reverted models when there is a damaging one?</span></span></a><br />
<a href="/T172370" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_216"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T172370: Remove reverted models from editquality repo</span></span></a></p>

<h3 class="remarkup-header">More, better signal</h3>

<p>We experimented with adding Flagged Revs data to our training set<br />
<a href="https://meta.wikimedia.org/wiki/Research_talk:Automated_classification_of_edit_quality/Work_log/2017-07-26" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Research_talk:Automated_classification_of_edit_quality/Work_log/2017-07-26</a><br />
<a href="/T166235" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_217"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166235: Flagged revs approve model to fiwiki</span></span></a></p>

<p><a href="https://phabricator.wikimedia.org/p/Sumit/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_228"><span class="phui-tag-core phui-tag-color-person">@Sumit</span></a> ran several experiments to see if word sentiment analysis could improve our classifier health.  We were able to get marginal benefits and so implemented the strategy.<br />
<a href="/T167305" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_218"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167305: Experiment with Sentiment score feature for draftquality</span></span></a><br />
<a href="/T170177" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_219"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170177: Test draftquality sentiment feature on Editquality</span></span></a></p>

<p><a href="https://phabricator.wikimedia.org/p/Natalia/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_229"><span class="phui-tag-core phui-tag-color-person">@Natalia</span></a> ran some experiments with including image-removals in the edit quality models and that didn&#039;t seem to affect performance. <br />
<a href="/T172049" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_220"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T172049: [Investigate] Get signal from adding/removing images</span></span></a></p>

<p><a href="https://phabricator.wikimedia.org/p/Nettrom/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_230"><span class="phui-tag-core phui-tag-color-person">@Nettrom</span></a> cleaned up the article quality data for English Wikipedia and that allowed us to boost fitness in strange cases (e.g. redirect pages)<br />
<a href="/T170434" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_221"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170434: Improve cleaning of article quality assessment datasets</span></span></a></p>

<p><a href="https://phabricator.wikimedia.org/p/Ladsgroup/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_231"><span class="phui-tag-core phui-tag-color-person">@Ladsgroup</span></a> added strategies for scanning labels and descriptions for badwords. <br />
<a href="/T162617" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_222"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162617: Use &#039;informals&#039;, &#039;badwords&#039;, etc. in Wikidata feature set</span></span></a><br />
<a href="/T170834" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_223"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170834: Add basic bad word check to Wikidata feature set</span></span></a></p>

<h3 class="remarkup-header">New model proposal: Draft topic prediction</h3>

<p>We&#039;re working on better ways for routing new page drafts to subject matter experts for review.  See our documentation pages.  We&#039;ll have datasets and some modeling experiments completed soon.</p>

<p><a href="https://meta.wikimedia.org/wiki/Research:Automatic_new_article_topics_suggestion" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Research:Automatic_new_article_topics_suggestion</a><br />
<a href="https://commons.wikimedia.org/wiki/File:New_article_routing.with_ORES.svg" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://commons.wikimedia.org/wiki/File:New_article_routing.with_ORES.svg</a></p>

<h3 class="remarkup-header">At Wikimania 2017</h3>

<p><a href="https://wikimania2017.wikimedia.org/wiki/Submissions/The_Keilana_Effect:_Visualizing_the_closing_coverage_gaps_with_ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://wikimania2017.wikimedia.org/wiki/Submissions/The_Keilana_Effect:_Visualizing_the_closing_coverage_gaps_with_ORES</a><br />
<a href="/T170015" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_224"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170015: [Workshop] How can I get ORES for my wiki?</span></span></a></p>

<h3 class="remarkup-header">JADE schema and design</h3>

<p>We&#039;ve spent some time planning how we&#039;ll implement the JADE system, which enables ORES users to give us feedback and have that feedback integrated into score results.</p>

<p><a href="/T175192" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_225"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T175192: Design JADE scoring schema</span></span></a></p>

<p>For more info see the project&#039;s home page (<a href="https://www.mediawiki.org/wiki/JADE" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/JADE</a>) and sub-pages <a href="https://www.mediawiki.org/wiki/JADE/Schema" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/JADE/Schema</a> &amp; <a href="https://www.mediawiki.org/wiki/JADE/Implementations" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/JADE/Implementations</a></p>

<p>We&#039;re actively recruiting ORES stakeholders to be part of our working group.</p>

<h3 class="remarkup-header">Thresholds</h3>

<p>We&#039;re in the process of rolling out a major refactor of the core <tt class="remarkup-monospaced">revscoring</tt> library.  One of the most exciting new features is the ability for ORES API consumers to fine-tune the thresholds used to define prediction intervals, e.g. &quot;Very likely damaging&quot;.  These thresholds will be different on every wiki, and the new interface allows us to query statistics built into the model, and satisfy criteria like &quot;get me the threshold with the maximum filter rate, with a recall of at least 90%&quot;.</p>

<p>For more details see the blog post, <a href="/J68" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_195"><span class="phui-tag-core phui-tag-color-object">Blog Post: More/better model information and &quot;threshold optimizations&quot;</span></a></p></div></content></entry><entry><title>Wikilabels incident: Reversed diffs!</title><link href="/phame/live/8/post/69/wikilabels_incident_reversed_diffs/" /><id>https://phabricator.wikimedia.org/phame/post/view/69/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-08-31T14:02:26+00:00</published><updated>2017-09-02T13:44:45+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>Today, we discovered a major regression in Wikilabels.  We&#039;ve patched the issue and made an emergency deployment.  We also deleted some labels that were saved while the system was compromised.  In this post, we&#039;ll describe what happened.</p>

<p>In order to generate visual representations of edit diffs for labeling, we use the API behind Wikipedia.  The old way of asking the API to generate a diff was to use <tt class="remarkup-monospaced">?action=query&amp;prop=revisions&amp;revids=...&amp;rvdiffto=...</tt>.   The revision for the <tt class="remarkup-monospaced">rvdiffto</tt> parameter would appear on the left and the revision for the <tt class="remarkup-monospaced">revids</tt> parameter would appear on the right.  Recently, this method of gathering diffs from the API has been deprecated in favor of using <tt class="remarkup-monospaced">?action=compare&amp;torev=...&amp;fromrev=...</tt>  The revision for the <tt class="remarkup-monospaced">fromrev</tt> parameter would appear on the left and the revision for the <tt class="remarkup-monospaced">torev</tt> parameter would appear on the right.  But we got that backwards!  So the UI would show that an edit removing vandalism was performing it!  Or worse, an edit vandalizing Wikipedia would look like it was cleaning up vandalism.  So needless to say, we can&#039;t trust the labels saved while Wikilabels was in a compromised state.</p>

<p>We&#039;ve deleted the 36 labels that were submitted to editquality campaigns during this period.  All labels should be clean from this point forward.  Labelers will notice that worksets they though were completed are now incomplete.  We&#039;re very sorry for this inconvenience.</p>

<p>Special thanks to Paupass for making us aware of this issue on the <a href="https://www.mediawiki.org/wiki/Topic:Tx9hwftq823yfi7t" class="remarkup-link remarkup-link-ext" rel="noreferrer">ORES flow board</a>.  Sorry to Papuass, Ivi104, Lsanabria, and 4shadoww for deleting their hard work.  I hope it won&#039;t slow them down much.</p>

<p>These are the deleted labels for reference (CCO):</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code"> task_id | user_id  |         timestamp          |                           data                           
---------+----------+----------------------------+----------------------------------------------------------
  445742 | 38109688 | 2017-08-30 15:31:52.354537 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  445765 | 38109688 | 2017-08-30 15:32:30.420068 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: true}
  445812 | 38109688 | 2017-08-30 15:32:35.83588  | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  445877 | 38109688 | 2017-08-30 15:32:38.619456 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  446388 | 38109688 | 2017-08-30 15:33:48.12102  | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  446433 | 38109688 | 2017-08-30 15:34:12.903998 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  446562 | 38109688 | 2017-08-30 15:34:16.460096 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  446752 | 38109688 | 2017-08-30 15:34:21.357112 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  447128 | 38109688 | 2017-08-30 15:34:24.683543 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  447219 | 38109688 | 2017-08-30 15:34:52.929963 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  447228 | 38109688 | 2017-08-30 15:35:41.194955 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  370394 | 35332911 | 2017-08-30 22:25:45.732859 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  370724 | 35332911 | 2017-08-30 22:25:52.676656 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  370788 | 35332911 | 2017-08-30 22:26:36.25715  | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  371101 | 35332911 | 2017-08-30 22:27:38.908425 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  370368 | 35332911 | 2017-08-31 05:41:25.739392 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  433975 |     4075 | 2017-08-31 09:14:10.80414  | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  434380 |     4075 | 2017-08-31 09:15:04.198489 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  434389 |     4075 | 2017-08-31 09:15:16.009233 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  451268 | 14073293 | 2017-08-31 11:46:47.204268 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  451324 | 14073293 | 2017-08-31 11:47:11.079806 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  451688 | 14073293 | 2017-08-31 11:48:25.680257 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: true}
  452159 | 14073293 | 2017-08-31 11:48:50.324313 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: true, &quot;unsure&quot;: true}
  452203 | 14073293 | 2017-08-31 11:49:09.644101 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  452210 | 14073293 | 2017-08-31 11:49:34.381716 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  452315 | 14073293 | 2017-08-31 11:49:45.622214 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  452376 | 14073293 | 2017-08-31 11:51:33.128251 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  452431 | 14073293 | 2017-08-31 11:52:11.38513  | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: true, &quot;unsure&quot;: true}
  452581 | 14073293 | 2017-08-31 11:52:42.420328 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  452734 | 14073293 | 2017-08-31 11:53:02.748251 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  452967 | 14073293 | 2017-08-31 11:54:23.412685 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  453027 | 14073293 | 2017-08-31 11:55:02.218287 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  453162 | 14073293 | 2017-08-31 11:56:35.027953 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}
  453192 | 14073293 | 2017-08-31 11:56:46.138465 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  453237 | 14073293 | 2017-08-31 11:58:35.359926 | {&quot;goodfaith&quot;: false, &quot;damaging&quot;: true, &quot;unsure&quot;: false}
  453463 | 14073293 | 2017-08-31 11:58:46.129857 | {&quot;goodfaith&quot;: true, &quot;damaging&quot;: false, &quot;unsure&quot;: false}</pre></div></div></content></entry><entry><title>More/better model information and &quot;threshold optimizations&quot;</title><link href="/phame/live/8/post/68/more_better_model_information_and_threshold_optimizations/" /><id>https://phabricator.wikimedia.org/phame/post/view/68/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-08-29T22:41:07+00:00</published><updated>2017-08-31T21:22:46+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>Today, I&#039;m writing to announce a breaking change in ORES that will come out about a month from now.  It will only change how information about prediction models is stored and reported.  This information is used by some tools to set thresholds at specified levels of confidence (e.g. &quot;give me the threshold that gives 90% <a href="https://en.wikipedia.org/wiki/Precision_and_recall" class="remarkup-link remarkup-link-ext" rel="noreferrer">recall</a>&quot;).  In this blog post, I&#039;ll explain how this is currently done and how it will be done once we deploy the change.</p>

<p>While you read through these examples, you can experiment with <a href="https://ores.wikimedia.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wikimedia.org</a> (current behavior) and <a href="https://ores-misc.wmflabs.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores-misc.wmflabs.org</a> (new behavior).  These systems will stay in this state until we deploy the newer version to production (probably around Sept. 20th).</p>

<h2 class="remarkup-header">Why you need model_info</h2>

<p>So, let&#039;s say you are going to use ORES to supply your counter-vandalism tool with &quot;damaging&quot; edit predictions.  A prediction looks like this:</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">&quot;damaging&quot;: {
  &quot;score&quot;: {
    &quot;prediction&quot;: true,
    &quot;probability&quot;: {
      &quot;false&quot;: 0.04445904933523648,
      &quot;true&quot;: 0.9555409506647635
    }
  }
}</pre></div>

<p>That &quot;probability&quot; looks interesting.  You&#039;d be tempted to assume that it corresponds to some operational metric of model fitness.  E.g. &quot;There&#039;s a 95.5% chance that this edit is damaging!&quot; but regretfully, you&#039;d be wrong.  This &quot;probability&quot; is a useful measure of the model&#039;s confidence but not a useful measure of how the model will work against a stream of new edits from the recent changes feed.  It turns out that operational metrics for classifiers like this one are all drawn around thresholds.  In truth, you get ~95% <a href="https://en.wikipedia.org/wiki/Precision_and_recall" class="remarkup-link remarkup-link-ext" rel="noreferrer">precision</a> when you set a threshold at 93% &quot;probability&quot;.</p>

<p>This gets even more complicated when you want to set thresholds based on other statistics.  E.g. &quot;<a href="https://en.wikipedia.org/wiki/Precision_and_recall" class="remarkup-link remarkup-link-ext" rel="noreferrer">recall</a>&quot; which is the measure of how much of a target class you match.  In vandal patrolling work, we want to make sure that we catch most (if not all) of the vandalism.  There&#039;s steep tradeoffs in classifiers if we ask for perfection, so let&#039;s just set a high bar at 90% recall -- catching 90% of the most egregious vandalism.  Where should you set your &quot;probability&quot; threshold in order to do that?  It turns out that you should set your threshold at 0.09.  Using this, you&#039;ll have to review less than 1/5th of the incoming edits and you&#039;ll be guaranteed to catch 90% of the damaging edits.</p>

<p>The act of finding the confidence threshold at a specified fitness level is something that we call a <em>threshold optimization</em> and it&#039;s something that all of our users want to be able to do.  We&#039;ve been providing this information in a limited and inflexible way for a long time.  But this change will make gathering information about a model in a machine-readable way much much easier.</p>

<h2 class="remarkup-header">Current model_info behavior</h2>

<p>Currently, model_info is static.  You can request it by adding <tt class="remarkup-monospaced">?model_info</tt> to your URLs. E.g. <a href="https://ores.wikimedia.org/v2/scores/enwiki/damaging?model_info" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wikimedia.org/v2/scores/enwiki/damaging?model_info</a> This model information is generated at the time that the model is trained and includes a static set of statistics and <em>threshold optimizations</em>.  Here&#039;s an example of a <em>threshold optimization</em> for the English Wikipedia damaging model:</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">&quot;filter_rate_at_recall(min_recall=0.9)&quot;: {
  &quot;false&quot;: {
    &quot;filter_rate&quot;: 0.121,
    &quot;recall&quot;: 0.9,
    &quot;threshold&quot;: 0.547
  },
  &quot;true&quot;: {
    &quot;filter_rate&quot;: 0.743,
    &quot;recall&quot;: 0.908,
    &quot;threshold&quot;: 0.148
  }
}</pre></div>

<p>This block of data says that you can select all edits that score above 0.148 &quot;probability&quot; and expect to catch 91% of the damaging edits.</p>

<p>In order to provide useful thresholds for ORES users, we&#039;d specify them at the time of model train/test.  First, we had three thresholds specified: filter_rate_at_recall(min_recall=0.9), filter_rate_at_recall(min_recall=0.75), and recall_at_precision(min_precision=0.9).  These <em>threshold optimizations</em> corresponded roughly to &quot;needs review&quot;, &quot;likely damaging&quot;, and &quot;almost certainly damaging&quot; respectively.</p>

<p>After working with the <a href="https://www.mediawiki.org/wiki/Collaboration/Team" class="remarkup-link remarkup-link-ext" rel="noreferrer">Collaboration Team</a> on the new <a href="https://www.mediawiki.org/wiki/Help:New_filters_for_edit_review" class="remarkup-link remarkup-link-ext" rel="noreferrer">RC Filters</a> system for patrolling Special:RecentChanges, the list of <em>threshold optimizations</em> ballooned to include: recall_at_fpr(max_fpr=0.1), recall_at_precision(min_precision=0.15), recall_at_precision(min_precision=0.45), recall_at_precision(min_precision=0.6), recall_at_precision(min_precision=0.75), recall_at_precision(min_precision=0.98), recall_at_precision(min_precision=0.99), and recall_at_precision(min_precision=0.995).  This was getting out of control.</p>

<p>So I started work on a new task <a href="/T162217" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_232"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162217: Implement &quot;thresholds&quot;, deprecate &quot;pile of tests_stats&quot;</span></span></a>.  See the description for a discussion I had with <a href="https://phabricator.wikimedia.org/p/Catrope/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_237"><span class="phui-tag-core phui-tag-color-person">@Catrope</span></a> to make sure I understood what he and his team needed.</p>

<h2 class="remarkup-header">New model_info behavior</h2>

<p>So, I hadn&#039;t planned on this work, but I thought dealing with it was a really good idea.  After all, it would make our users&#039; life easier and my life easier because I wouldn&#039;t need to re-train the models every time that a new threshold optimization was needed.  I could also take this opportunity to implement some important <a href="/tag/revscoring/" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_239"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-briefcase" data-meta="0_238" aria-hidden="true"></span>revscoring</span></a> stuff I&#039;d been putting off.  E.g. <a href="/T160223" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_233"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T160223: Store the detailed system information inside of model files.  </span></span></a>, <a href="/T172566" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_234"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T172566: Include label-specific schemas with model_info</span></span></a>, and <a href="/T163711" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_235"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163711: Use our own scoring models in `tune` utility</span></span></a>.  A couple weekends, a holiday, and a hackathon later, I had something that worked.  Fun story: I actually fully implemented the system several times and decided to refactor and re-engineer the model_info system entirely.  This allowed me to iteratively reduce complexity and coupled-ness.</p>

<p>The new system can currently be tested at <a href="https://ores-misc.wmflabs.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores-misc.wmflabs.org</a>.  When we ask for <tt class="remarkup-monospaced">?model_info</tt>, we see something that&#039;s a little different.  I&#039;ll make some time in other blog posts to talk about &#039;environment&#039; and &#039;score_schema&#039;.  For now, I just want to talk about &#039;statistics&#039; that replaces &#039;test_stats&#039;.</p>

<h3 class="remarkup-header">Digging into &quot;statistics&quot;</h3>

<p>The first thing that is different is that we now generate aggregate statistics across output labels.</p>

<p><strong>old</strong> (<a href="https://ores.wikimedia.org/v2/scores/enwiki/draftquality/?model_info=test_stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">query</a>):</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">&quot;f1&quot;: {
  &quot;OK&quot;: 0.99,
  &quot;attack&quot;: 0.136,
  &quot;spam&quot;: 0.586,
  &quot;vandalism&quot;: 0.341
}</pre></div>

<p><strong>new</strong> (<a href="https://ores-misc.wmflabs.org/v2/scores/enwiki/draftquality/?model_info=statistics" class="remarkup-link remarkup-link-ext" rel="noreferrer">query</a>):</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">&quot;f1&quot;: {
  &quot;labels&quot;: {
    &quot;OK&quot;: 0.974,
    &quot;attack&quot;: 0.136,
    &quot;spam&quot;: 0.586,
    &quot;vandalism&quot;: 0.341
  },
  &quot;macro&quot;: 0.509,
  &quot;micro&quot;: 0.962
}</pre></div>

<p>A macro-average of the label statistics is just a simple average across the reported statistic for each label.  (0.974 + 0.136 + 0.586 + 0.341) / 4 = 0.509.  The micro-average is a weighted by the number of observations.  Since the &quot;OK&quot; class if far more common than any other and gets a relatively high f1 score, the micro-average is much higher than the macro-average.</p>

<p>All types of statistics now have these aggregates by default.</p>

<h3 class="remarkup-header">Digging into &quot;thresholds&quot;</h3>

<p>OK so what about the thresholds thing that is the whole premise of this blog post?  Well, I think you&#039;re going to like this.  I&#039;ve built a light-weight querying system into the abstract concept of &quot;thresholds&quot; that will allow you to get whatever threshold you like -- so long as your strategy for getting it involves optimizing one statistic (&quot;maximum filter_rate&quot;) and holding another constant (&quot;@ recall &gt;= 0.9&quot;).</p>

<p><tt class="remarkup-monospaced">?model_info=statistics.thresholds.true.&quot;maximum filter_rate @ recall &gt;= 0.9&quot;</tt>:</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">&quot;thresholds&quot;: {
  &quot;true&quot;: [
    {
      &quot;!f1&quot;: 0.883,
      &quot;!precision&quot;: 0.996,
      &quot;!recall&quot;: 0.794,
      &quot;accuracy&quot;: 0.797,
      &quot;f1&quot;: 0.233,
      &quot;filter_rate&quot;: 0.77,
      &quot;fpr&quot;: 0.206,
      &quot;match_rate&quot;: 0.23,
      &quot;precision&quot;: 0.134,
      &quot;recall&quot;: 0.901,
      &quot;threshold&quot;: 0.09295862121864444
    }
  ]
}</pre></div>

<p>Here, you can see that we get the same information back, but we&#039;re allowed to choose arbitrary optimizations and have the system report back to us where we should place our thresholds.</p>

<p>I asked <a href="https://phabricator.wikimedia.org/p/Catrope/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_240"><span class="phui-tag-core phui-tag-color-person">@Catrope</span></a> to put together a task for me to demo how I&#039;d just this system to get the optimizations he needs.  See <a href="https://phabricator.wikimedia.org/T173019" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_236"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T173019</span></span></a>.  This will require me to request multiple optimizations at the same time.  Here&#039;s the full URL:</p>

<p><tt class="remarkup-monospaced">?model_info=statistics.thresholds.true.&quot;maximum filter_rate @ recall &gt;= 0.9&quot;|statistics.thresholds.true.&quot;maximum recall @ precision &gt;= 0.15&quot;</tt></p>

<p>Which gives us:</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">&quot;thresholds&quot;: {
  &quot;true&quot;: [
    {
      &quot;!f1&quot;: 0.883,
      &quot;!precision&quot;: 0.996,
      &quot;!recall&quot;: 0.794,
      &quot;accuracy&quot;: 0.797,
      &quot;f1&quot;: 0.233,
      &quot;filter_rate&quot;: 0.77,
      &quot;fpr&quot;: 0.206,
      &quot;match_rate&quot;: 0.23,
      &quot;precision&quot;: 0.134,
      &quot;recall&quot;: 0.901,
      &quot;threshold&quot;: 0.09295862121864444
    },
    {
      &quot;!f1&quot;: 0.906,
      &quot;!precision&quot;: 0.993,
      &quot;!recall&quot;: 0.833,
      &quot;accuracy&quot;: 0.834,
      &quot;f1&quot;: 0.256,
      &quot;filter_rate&quot;: 0.81,
      &quot;fpr&quot;: 0.167,
      &quot;match_rate&quot;: 0.19,
      &quot;precision&quot;: 0.151,
      &quot;recall&quot;: 0.838,
      &quot;threshold&quot;: 0.14750910213671917
    }
  ]
}</pre></div>

<p>So there you have it!  There&#039;s lots more you can do with this model_info system, but we&#039;ll need to save that for another blog post.  For now, let us know if you have concerns with the new threshold optimization scheme.</p>

<h2 class="remarkup-header">The deployment plan</h2>

<p>This announcement blog post is the first step of our deployment plan.  We&#039;ll be reaching out to <a href="https://phabricator.wikimedia.org/p/Catrope/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_241"><span class="phui-tag-core phui-tag-color-person">@Catrope</span></a>, <a href="https://phabricator.wikimedia.org/p/Petrb/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_242"><span class="phui-tag-core phui-tag-color-person">@Petrb</span></a>, <a href="https://phabricator.wikimedia.org/p/Ragesoss/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_243"><span class="phui-tag-core phui-tag-color-person">@Ragesoss</span></a>, and other developers who use ORES to make sure that they know this change is coming over the next week.  A week from now (Sept. 5th), we&#039;ll deploy the new model_info system to <a href="https://ores.wmflabs.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wmflabs.org</a> and <a href="https://ores-beta.wmflabs.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores-beta.wmflabs.org</a>.  Then we&#039;ll wait at least two weeks and confirm that adaptations have been made to the tools that we know about before finally deploying to <a href="https://ores.wikimedia.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wikimedia.org</a> (~Sept. 20th)</p></div></content></entry><entry><title>Laughing ORES to death with regular expressions and fake threads</title><link href="/phame/live/8/post/64/laughing_ores_to_death_with_regular_expressions_and_fake_threads/" /><id>https://phabricator.wikimedia.org/phame/post/view/64/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-08-17T21:29:19+00:00</published><updated>2017-09-06T11:50:25+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p><em>At 1100 UTC on June 23rd, ORES started to struggle.  Within a half hour, it had fully choked and could no longer respond to any requests.  It took us 10 hours to diagnose the problem, solve it, and consider it solved.  We learned some valuable lessons when studying and <a href="https://wikitech.wikimedia.org/wiki/Incident_documentation/20170623-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">addressing this issue</a>.</em></p>

<p>You can&#039;t prevent <em>bad things</em> from happening.  Something will always go wrong.  So you do the best that you can to handle <em>bad things</em> gracefully.  In a distributed processing environment like ORES&#039; cluster, the worst thing that could happen is to have a process block for forever.  So, preparing for bad things means you use timeouts for just about everything.  So far, this has been a great strategy and it makes it so that, at worst, only a few requests out of many will fail when something goes wrong.  Regretfully, for this downtime event we had one of the worst <em>bad things</em> happen, and at the same time we discovered that our timeouts were not capable of stopping deep processes that go rogue in a specific way.  In this blog post, I&#039;ll explain what happened.</p>

<h2 class="remarkup-header">Recursive backtracking in a regular expression</h2>

<p>Many of the models deployed in ORES use regular expressions to extract signal about the quality of an edit.  For example, we use them to match &quot;badwords&quot; (curse words, racial slurs, and other words that are commonly used to cause offense) and &quot;informals&quot; (linguistic colloquialisms like &quot;haha&quot; or &quot;lol&quot; or &quot;wtf&quot;).  One such regular expression that we used to match informal laughing in Spanish language looked like this: <tt class="remarkup-monospaced">/j+[eaiou]+(j+[aeiou]*)*/</tt>  It is intended to match strings like &quot;jajajaja&quot; or &quot;jijiji&quot;.</p>

<p>In <a href="https://es.wikipedia.org/w/index.php?title=Relieve_terrestre&amp;type=revision&amp;diff=100032572" class="remarkup-link remarkup-link-ext" rel="noreferrer">this edit</a> of Spanish Wikipedia, an IP editor added a very long string of repeated &quot;JAJAJJAJAJAJAJAJAJ&quot; to the article for &quot;<a href="https://es.wikipedia.org/wiki/Relieve_terrestre" class="remarkup-link remarkup-link-ext" rel="noreferrer">Terrain</a>&quot;.  This is exactly what the regular expression was designed to match.  But there was a problem.  This regular expression was poorly designed in that it caused a catastrophic backtracking pattern.  Every time it would match the entire sequence of &quot;JAJAJJAJAJAJAJAJAJ&quot; and then fail when encountered &quot;...JAJAJlentos...&quot;, it would re-attempt the entire match dropping just one &quot;JA&quot; from the middle.  This problem doesn&#039;t really matter for any short sequences.  But for one very long one (and this one was 4155 chars long == 230 repetitions of &quot;JAJAJJAJAJAJAJAJAJ&quot;), it would have taken <strong>days</strong> to finish.  The plot below demonstrates how badly things break down at only 14 repetitions.</p>

<p><div class="phabricator-remarkup-embed-layout-left"><a href="https://phab.wmfusercontent.org/file/data/5mivx6qjsjqbk7tpnvxf/PHID-FILE-kss5o7mcpw6cdqk5lcnv/Pathological_backtracking_regex_%28ORES%29.png" class="phabricator-remarkup-embed-image" data-sigil="lightboxable" data-meta="0_244"><img src="https://phab.wmfusercontent.org/file/data/ztnt4ayifaooux3ejl2t/PHID-FILE-dzol2kxu5cwjao77wzcm/preview-Pathological_backtracking_regex_%28ORES%29.png" width="220" height="157.09375" alt="Pathological_backtracking_regex_(ORES).png (914×1 px, 24 KB)" /></a></div><br />
<div href="/F8926897" target="_blank" rel="noreferrer" class="phabricator-remarkup-embed-layout-link " data-sigil="lightboxable" data-meta="0_245" data-mustcapture="1"><span class="visual-only phui-icon-view phui-font-fa fa-file-text-o phabricator-remarkup-embed-layout-icon" data-meta="0_246" aria-hidden="true"></span><span class="phabricator-remarkup-embed-layout-info-block"><span class="phabricator-remarkup-embed-layout-name">pathological_backtracking_regex.svg</span><span class="phabricator-remarkup-embed-layout-info">58 KB</span></span><a class="phabricator-remarkup-embed-layout-download" href="https://phab.wmfusercontent.org/file/download/wividryrk54y4hl24ak7/PHID-FILE-dn6bwluqt7dhskk55ox2/pathological_backtracking_regex.svg">Download</a></div></p>

<h2 class="remarkup-header">Where were the timeouts?</h2>

<p>Things like this happen.  When operating in a distributed processing environment, you should always have timeouts on everything so that if something goes haywire, it doesn&#039;t take everything down.  Regretfully, matching a regular expression is not just a special opportunity for pathological backtracking, but also an opportunity to learn hard lessons about safe timeouts.</p>

<p>We have timeouts in ORES in a few strategic places.  E.g. if a single scoring job takes longer than 15 seconds (extracting informal &quot;JAJAJA&quot; is part of a scoring job), then it is supposed to time out.  But for some reason, we weren&#039;t timing out during regular expression matching.  I started digging into the library we use to implement execution timeouts, and what I learned was horrifying.</p>

<p>Most timeouts in python are implemented with &quot;threads&quot;.  I put &quot;threads&quot; in quotes because threads in python are a convenient abstraction and not true concurrency.  Python&#039;s <a href="https://wiki.python.org/moin/GlobalInterpreterLock" class="remarkup-link remarkup-link-ext" rel="noreferrer">Global Interprer Lock</a>(GIL) is an internal mutex that prevents truly concurrent threading.  In order to get around this, python uses separate processes to implement concurrency.  I&#039;m not going to get into the details of the GIL or process based concurrency,  but suffice it to say, if you use an external C library to execute a regular expression match on a string, any thread that is trying to implement a timeout is going to get locked up and totally fail to do what it is supposed to do!</p>

<p>Because our threading-based timeouts were completely disabled by this long regular expression match, our &quot;precaching&quot; system (makes sure we score every edit and put the score in the cache ASAP) was slowly taking us down.  Every time the problematic diff was requested, it would render yet another worker unreachable.  Because ORES would just fail to respond, our precaching system registered a Connection Timeout and would simply retry the request.  Eventually capacity would decay as our ~200 workers were locking at 100% CPU one by one.</p>

<p>Luckily, there&#039;s an easy solution to this problem in <a href="https://stackoverflow.com/questions/492519/timeout-on-a-function-call" class="remarkup-link remarkup-link-ext" rel="noreferrer">unix signals</a>.  By having the operating system help us manage our timeouts, we could stop relying on python threads to behave sanely in order for us to recover from future rogue processes.</p>

<h2 class="remarkup-header">So, you fixed it right?</h2>

<p>First, I should thank <a href="https://phabricator.wikimedia.org/p/ssastry/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_247"><span class="phui-tag-core phui-tag-color-person">@ssastry</span></a> for his quick work identifying the pathological backtracking problem and submitting a fix.  We also completed an emergency deployment of ORES that implemented the use of Unix signals and we&#039;ve been humming along, scoring all of the things, ever since.</p></div></content></entry><entry><title>Announcing the Scoring Platform team</title><link href="/phame/live/8/post/62/announcing_the_scoring_platform_team/" /><id>https://phabricator.wikimedia.org/phame/post/view/62/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-07-21T16:46:59+00:00</published><updated>2017-07-25T02:10:20+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p><em>The Wikimedia Foundation’s new Scoring Platform team, led by Aaron Halfaker, will be working on democratizing access to AI, developing new types of AI predictions, and pushing the state of the art with regards to ethical practice of AI development.</em></p>

<p><div class="phabricator-remarkup-embed-layout-left"><a href="https://phab.wmfusercontent.org/file/data/xexdhcd6o7l2cahmn5k5/PHID-FILE-kznozkkmwsltnsn7xlhl/wikipedia_glasses.jpg" class="phabricator-remarkup-embed-image" data-sigil="lightboxable" data-meta="0_248"><img src="https://phab.wmfusercontent.org/file/data/vtwsvkgd5knzzmdsbeh6/PHID-FILE-4jmxgf4rjlf6gygstu5k/preview-wikipedia_glasses.jpg" width="220" height="135.09375" alt="wikipedia_glasses.jpg (393×640 px, 44 KB)" /></a></div><br />
Illustration by Mun May Tee-Galloway, CC BY-SA 4.0.</p>

<p>On January 12, 2015, an editor by the name of Blank123456789 <a href="https://en.wikipedia.org/w/index.php?title=Dog_intelligence&amp;diff=642215445" class="remarkup-link remarkup-link-ext" rel="noreferrer">noted</a> that “LLAMAS GROW ON TREES” in the article about <a href="https://en.wikipedia.org/wiki/Dog_intelligence" class="remarkup-link remarkup-link-ext" rel="noreferrer">Dog intelligence</a>.  Within a second, the edit was flagged by an algorithm as potentially problematic.</p>

<p>Another Wikipedia editor named IronGargoyle saw this flagged edit in an advance curation tool called Huggle.  With a glance, he was able to identify the edit as problematic and strike it down. This whole interaction took a matter of seconds. A vandal vandalizes, and a patroller supported by advanced vandalism detection artificial intelligences (AIs) sees the problem and corrects it.</p>

<p>AIs make the work of maintaining <a href="https://wikimediafoundation.org/wiki/Our_projects" class="remarkup-link remarkup-link-ext" rel="noreferrer">massive encyclopedias, dictionaries, databases, and more</a> much easier by making a lot of large scale tasks (like counter-vandalism and <a href="https://en.wikipedia.org/wiki/Wikipedia:WikiProject_assessment" class="remarkup-link remarkup-link-ext" rel="noreferrer">article quality assessment</a>) much easier and quicker to spot and handle.  Historically, the AIs that have helped Wikipedians were built and maintained by volunteers. While these systems filled a critical infrastructural role, they were generally only available for the English Wikipedia and did not scale well.</p>

<p>Over the past few years, I have been working alongside a large group of volunteers on a core technology that makes basic AI support for wiki-work much more accessible to non-AI specialist developers.  Named “<a href="https://www.mediawiki.org/wiki/ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">ORES</a>,” it is an artificial intelligence service that makes predictions about which edits are vandalism, which new page creations are problematic, and which articles are ready to be nominated for Featured status. (See our past posts about <a href="https://blog.wikimedia.org/2015/11/30/artificial-intelligence-x-ray-specs/" class="remarkup-link remarkup-link-ext" rel="noreferrer">how it works</a> and <a href="https://blog.wikimedia.org/2017/03/07/the-keilana-effect/" class="remarkup-link remarkup-link-ext" rel="noreferrer">measuring content gaps with ORES</a>)</p>

<p>Without a doubt, the project has been a breakaway success. The beta feature has 26,000 active users and over 20 third party tools, is actively running in production, and has received positive write-ups in <em><a href="https://www.wired.com/2015/12/wikipedia-is-using-ai-to-expand-the-ranks-of-human-editors/" class="remarkup-link remarkup-link-ext" rel="noreferrer">Wired</a>, <a href="https://www.technologyreview.com/s/544036/artificial-intelligence-aims-to-make-wikipedia-friendlier-and-better/" class="remarkup-link remarkup-link-ext" rel="noreferrer">MIT Tech Review</a>, and <a href="http://www.bbc.com/news/technology-34982570" class="remarkup-link remarkup-link-ext" rel="noreferrer">the BBC</a></em>. As a result, we’ve become a leader in conversations around detecting and mitigating biases, and have built collaborations with researchers at <a href="https://en.wikipedia.org/wiki/University_of_California,_Berkeley" class="remarkup-link remarkup-link-ext" rel="noreferrer">UC-Berkeley</a>, <a href="https://en.wikipedia.org/wiki/University_of_Minnesota" class="remarkup-link remarkup-link-ext" rel="noreferrer">UMN</a>, <a href="https://en.wikipedia.org/wiki/Carnegie_Mellon_University" class="remarkup-link remarkup-link-ext" rel="noreferrer">CMU</a>, <a href="https://en.wikipedia.org/wiki/%C3%89cole_nationale_sup%C3%A9rieure_des_t%C3%A9l%C3%A9communications_de_Bretagne" class="remarkup-link remarkup-link-ext" rel="noreferrer">Télécom Bretagne</a>, and <a href="https://en.wikipedia.org/wiki/Northwestern_University" class="remarkup-link remarkup-link-ext" rel="noreferrer">Northwestern</a>.</p>

<h3 class="remarkup-header">Meet the Scoring Platform team</h3>

<p><div class="phabricator-remarkup-embed-layout-left"><a href="https://phab.wmfusercontent.org/file/data/fxgzkyjhyp55dzvftqlc/PHID-FILE-ruxb4lekutoehzrtqiwu/1024px-Halfaker%2C_Aaron_Sept_2013.jpg" class="phabricator-remarkup-embed-image" data-sigil="lightboxable" data-meta="0_249"><img src="https://phab.wmfusercontent.org/file/data/zrbhkxzcb3ebqlku5txu/PHID-FILE-p3w4tqyfbuqhckkjgr5b/preview-1024px-Halfaker%2C_Aaron_Sept_2013.jpg" width="220" height="146.73828125" alt="1024px-Halfaker,_Aaron_Sept_2013.jpg (683×1 px, 80 KB)" /></a></div><br />
<a href="https://commons.wikimedia.org/wiki/File:Halfaker,_Aaron_Sept_2013.jpg" class="remarkup-link remarkup-link-ext" rel="noreferrer">Photo</a> by Myleen Hollero/Wikimedia Foundation, CC BY-SA 3.0.</p>

<p>The new <a href="https://mediawiki.org/wiki/Wikimedia_Scoring_Platform_team" class="remarkup-link remarkup-link-ext" rel="noreferrer">Scoring Platform team</a> is led by <a href="https://en.wikipedia.org/wiki/Aaron_Halfaker" class="remarkup-link remarkup-link-ext" rel="noreferrer">Aaron Halfaker</a>, a principal research scientist who authored a series of studies into <a href="https://meta.wikimedia.org/wiki/Research:The_Rise_and_Decline" class="remarkup-link remarkup-link-ext" rel="noreferrer">Wikipedia’s newcomer decline</a> and designed <a href="https://en.wikipedia.org/wiki/WP:Snuggle" class="remarkup-link remarkup-link-ext" rel="noreferrer">Snuggle</a>, a newcomer socialization support tool.  ORES is the next item on Dr. Halfaker’s research agenda.  He hypothesizes that by enabling a broader set of people to build powerful, AI-driven wiki tools, some of Wikipedia’s fundamental <a href="https://en.wikipedia.org/wiki/Sociotechnical_system" class="remarkup-link remarkup-link-ext" rel="noreferrer">socio-technical</a> problems may become much easier to solve.</p>

<p><div class="phabricator-remarkup-embed-layout-left"><a href="https://phab.wmfusercontent.org/file/data/732slv7rjtqc7fxowgrt/PHID-FILE-7ray6jsdzkciacbhup5f/Fawiki500k_celebration_by_Behdad_Abedi_%2810%29.jpg" class="phabricator-remarkup-embed-image" data-sigil="lightboxable" data-meta="0_250"><img src="https://phab.wmfusercontent.org/file/data/6fsrtzir3pby46hah7gc/PHID-FILE-vn64hyrjskw7lrt7plvv/preview-Fawiki500k_celebration_by_Behdad_Abedi_%2810%29.jpg" width="220" height="146.73828125" alt="Fawiki500k_celebration_by_Behdad_Abedi_(10).jpg (683×1 px, 64 KB)" /></a></div><br />
<a href="https://commons.wikimedia.org/wiki/File:Fawiki500k_celebration_by_Behdad_Abedi_(10).jpg" class="remarkup-link remarkup-link-ext" rel="noreferrer">Photo</a> by Mardetanha, CC BY-SA 4.0.</p>

<p>Amir Sarabadani will be continuing his work as a quasi-volunteer and contractor for our peer organization, Wikimedia Germany.  Amir has developed several bots and bot-building utilities that are used to maintain content in Wikipedia and <a href="https://en.wikipedia.org/wiki/Wikidata" class="remarkup-link remarkup-link-ext" rel="noreferrer">Wikidata</a>.  Amir has been a core contributor since the early days of the volunteer-driven “Revision Scoring as a Service” project, and is the primary author of our insanely popular Beta feature—the ORES Review Tool.</p>

<p><div class="phabricator-remarkup-embed-layout-left"><a href="https://phab.wmfusercontent.org/file/data/h46rx6gtfkctcdtup43s/PHID-FILE-qmtuta5z4fqaxr2bshph/Adam_Roses_Wight.jpg" class="phabricator-remarkup-embed-image" data-sigil="lightboxable" data-meta="0_251"><img src="https://phab.wmfusercontent.org/file/data/xtphyiukjrlqfeufpset/PHID-FILE-rzwlpg3lhuiamqakk2gb/preview-Adam_Roses_Wight.jpg" width="176.21103117506" height="220" alt="Adam_Roses_Wight.jpg (417×334 px, 51 KB)" /></a></div><br />
<a href="https://commons.wikimedia.org/wiki/File:Adam_Roses_Wight.jpg" class="remarkup-link remarkup-link-ext" rel="noreferrer">Photo</a> by Adam Wight, CC BY-SA 3.0.</p>

<p>As of this month, the team is welcoming their first full-time, budgeted engineer, Adam Wight.  He has worked with the Wikimedia Foundation’s fundraising team since 2012, volunteered for ORES and the Education Program. Outside of computers, he’s done a few eclectic things like helping to start “<a href="http://sustainability.berkeley.edu/food" class="remarkup-link remarkup-link-ext" rel="noreferrer">The Local</a>” food co-op and People’s University, an open-air school on subjects ranging from philosophy to the history of <a href="https://en.wikipedia.org/wiki/Adventure_playground" class="remarkup-link remarkup-link-ext" rel="noreferrer">adventure playgrounds</a> and practical blacksmithing.  Adam is currently working out the <a href="https://www.mediawiki.org/wiki/Meta-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">details</a> of an auditing system that will allow humans to more effectively critique ORES’ predictions.</p>

<h3 class="remarkup-header">Where we plan to go next</h3>

<p>In the next year, the Scoring Platform team to work in three new directions:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><strong>Democratizing access to AI.</strong> We’ll increase the availability of advanced AIs to more wiki communities.  Small, but growing communities need AI support the most, so we’ll be targeting these <a href="https://meta.wikimedia.org/wiki/Community_Engagement/Defining_Emerging_Communities" class="remarkup-link remarkup-link-ext" rel="noreferrer">emerging communities</a> to make sure they are well supported.</li>
<li class="remarkup-list-item"><strong>Developing new types of AI predictions.</strong>  The team is currently experimenting with new types of AIs for supporting different types of Wikipedians’ work.  We’re collaborating with external researchers to develop prediction models.</li>
<li class="remarkup-list-item"><strong>Pushing the state of the art with regards to ethical practice of AI development.</strong>  AIs can be scary in all sorts of ways.  They can perpetuate biases in hidden ways, silence the voices of those who don’t conform, and simply operate at speeds and scales far exceeding mere humans.  We’re building a human-driven auditing system for ORES’ predictions so that human contributors will have a new and powerful way to keep ORES in check.</li>
</ul>

<p>Until now, ORES was primarily a volunteer-driven project.  With minimal financial support, a <a href="https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service/Archived" class="remarkup-link remarkup-link-ext" rel="noreferrer">ragtag team</a> was able to build a production-level service that supports 29 languages and 35 different Wikimedia project wikis.  The <a href="https://www.mediawiki.org/wiki/ORES_review_tool" class="remarkup-link remarkup-link-ext" rel="noreferrer">ORES Review Tool</a> (a simple tool for helping with counter-vandalism work) has been a breakaway success, with over 26k editors installing the beta feature before it was enabled by default.</p>

<h3 class="remarkup-header">How to learn more and get involved</h3>

<p>The Scoring Platform team welcomes collaboration and volunteers to get involved with the project.  See the <a href="https://www.mediawiki.org/wiki/Wikimedia_Scoring_Platform_team" class="remarkup-link remarkup-link-ext" rel="noreferrer">team’s page</a> and <a href="https://phabricator.wikimedia.org/phame/blog/view/8/" class="remarkup-link" rel="noreferrer">our technical blog</a> for more information about how to get involved.  See <a href="https://www.mediawiki.org/wiki/ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">ORES’ documentation</a> for more information about using the service or getting support for your wiki.  Or join the larger community of people interested in applying AI to make wikis work better via our <a href="https://lists.wikimedia.org/mailman/listinfo/ai" class="remarkup-link remarkup-link-ext" rel="noreferrer">mailing list</a> and IRC channel (<a href="http://webchat.freenode.net/?channels=wikimedia-ai" class="remarkup-link remarkup-link-ext" rel="noreferrer">#wikimedia-ai on freenode</a>).</p></div></content></entry><entry><title>Status update (July 11th, 2017)</title><link href="/phame/live/8/post/58/status_update_july_11th_2017/" /><id>https://phabricator.wikimedia.org/phame/post/view/58/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-07-12T22:44:27+00:00</published><updated>2017-07-21T16:53:25+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p><em>Two outages with documentation. Revscoring 2.0 coming with better model information and &quot;thresholds&quot;. New support for Romanian, Albanian, Tamil, Greek, and Bengali. We&#039;re officially welcoming <a href="https://phabricator.wikimedia.org/p/awight/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_323"><span class="phui-tag-core phui-tag-color-person">@awight</span></a> to the team!</em></p>

<p>Hey folks!</p>

<p>As of July 1st, we are officially the Scoring Platform team.  We&#039;re welcoming Adam Wight (<a href="https://phabricator.wikimedia.org/p/awight/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_324"><span class="phui-tag-core phui-tag-color-person">@awight</span></a>) to the team officially.</p>

<p>The last ~month was very productive, but we had two major production issues.  See <a href="https://wikitech.wikimedia.org/wiki/Incident_documentation/20170613-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">20170613-ORES</a> and <a href="https://wikitech.wikimedia.org/wiki/Incident_documentation/20170623-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">20170623-ORES</a>.  As you will see below, there&#039;s a series of tasks that address problems that were related to these issues.</p>

<p>Despite dealing with production issues, we&#039;ve been able to get a very substantial change to the <a href="/tag/revscoring/" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_326"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-briefcase" data-meta="0_325" aria-hidden="true"></span>revscoring</span></a> library merged.  This change will make accessing information about models (build environment, test statistics, scoring thresholds, etc.) much easier.  This will cause a breaking change in ORES UI so we&#039;ll be making an announcement when we roll it out.  Stay tuned.</p>

<p>We&#039;ve also increased our language and model coverage substantially.  We even built and deployed a totally new type of model to help out French Wikisource!</p>

<h3 class="remarkup-header">New team stuff</h3>

<p>So with the new fiscal year, we&#039;re a new team.  We&#039;re working on an announcement to be posted on the WMF blog.  That should be coming out soon.  See <a href="https://phabricator.wikimedia.org/T169755" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_252"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169755</span></span></a>.  Most of the new team stuff focused on getting Adam all of the rights he needed to do ORES deploys and other work.</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T168917" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_253"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168917: Get Adam all the rights</span></span></a></li>
<li class="remarkup-list-item"><a href="/T168443" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_254"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168443: Grant AWight CR+2 on scoring platform repos</span></span></a></li>
<li class="remarkup-list-item"><a href="/T169915" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_255"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169915: Create scoring-internal mailing list for Scoring Platform team</span></span></a></li>
<li class="remarkup-list-item"><a href="/T168442" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_256"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168442: Grant AWight accounts on ores production clusters</span></span></a></li>
</ul>

<h3 class="remarkup-header">ORES downtime</h3>

<p>We had two major downtime events with ORES.  One of these (<a href="https://wikitech.wikimedia.org/wiki/Incident_documentation/20170613-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">20170613-ORES</a>) was not our fault, but we still set up better monitoring (<a href="https://phabricator.wikimedia.org/T167830" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_257"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167830</span></span></a>) so that, when it happens again, we can fix it more quickly.  The second event (<a href="https://wikitech.wikimedia.org/wiki/Incident_documentation/20170623-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">20170623-ORES</a>) was due to a deeply problematic regular expression pattern that had ~ a 1 in a billion chance of causing catastrophic failure.  We both fixed the regular expression (<a href="https://phabricator.wikimedia.org/T168888" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_258"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168888</span></span></a>) and fixed the timeout that didn&#039;t catch the out-of-control regex match (<a href="https://phabricator.wikimedia.org/T168965" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_259"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168965</span></span></a>)</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T167819" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_260"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167819: ORES in eqiad is unhappy</span></span></a></li>
<li class="remarkup-list-item">{T168773}</li>
<li class="remarkup-list-item"><a href="/T167830" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_261"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167830: Extend icinga check to catch 500 errors like those of the 20170613 incident</span></span></a></li>
<li class="remarkup-list-item"><a href="/T169367" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_262"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169367: [Investigate] some revisions frequently return TaskRevokedError </span></span></a></li>
<li class="remarkup-list-item"><a href="/T168888" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_263"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168888: Fix degenerate regular expressions for matching &quot;hahaha&quot; and &quot;jajaja&quot;</span></span></a></li>
<li class="remarkup-list-item"><a href="/T168965" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_264"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168965: Why don&#039;t timeouts work during long regular expression matching?</span></span></a></li>
<li class="remarkup-list-item"><a href="/T170205" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_265"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T170205: Add test to ensure timeout of functions taking too long</span></span></a></li>
<li class="remarkup-list-item"><a href="/T168889" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_266"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168889: Rebuild all of the models for ORES (new regexes)</span></span></a></li>
</ul>

<h3 class="remarkup-header">New language support</h3>

<p>We were lucky to have a lot of volunteers working with us this month so that allowed us to make a lot of progress towards expanding support to more wikis. Both the Albanian and Romanian Wikipedias finished their labeling campaigns so we&#039;ll be able to deploy advanced support to them soon (<a href="https://phabricator.wikimedia.org/T163010" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_267"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163010</span></span></a>, <a href="https://phabricator.wikimedia.org/T156517" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_268"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T156517</span></span></a>).  We now have some of the basic language assets for Tamil so we should be able to build up basic support for that Wikipedia soon (<a href="https://phabricator.wikimedia.org/T166052" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_269"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166052</span></span></a>).  We also implemented an article quality model for Turkish Wikipedia (<a href="https://phabricator.wikimedia.org/T164671" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_270"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164671</span></span></a>) thanks to lots of work by <a href="https://phabricator.wikimedia.org/p/Mavrikant/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_327"><span class="phui-tag-core phui-tag-color-person">@Mavrikant</span></a>.  We developed a new strategy for cross-language badword/informal detection and addressed some lingustic overlap between English and Hungarian Wikipedia (<a href="https://phabricator.wikimedia.org/T167231" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_271"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167231</span></span></a>, <a href="https://phabricator.wikimedia.org/T165872" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_272"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165872</span></span></a>).  We implemented a page-level OCR model for French Wikisource (somewhat like article quality, but more about the quality of machine reader transcriptions) (<a href="https://phabricator.wikimedia.org/T167196" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_273"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167196</span></span></a>).  Finally, we deployed the ORES Review Tool to French Wikipedia (<a href="https://phabricator.wikimedia.org/T165044" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_274"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165044</span></span></a>)</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T163010" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_275"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163010: Complete Albanian Wikipedia editquality campaign</span></span></a></li>
<li class="remarkup-list-item"><a href="/T156517" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_276"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T156517: Complete Romanian Wikipedia edit quality campaign</span></span></a></li>
<li class="remarkup-list-item"><a href="/T166052" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_277"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166052: Language assets for Tamil</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164671" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_278"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164671: Implement wp10 model for trwiki</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167231" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_279"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167231: Remove other non-badwords from huwiki model. </span></span></a></li>
<li class="remarkup-list-item"><a href="/T165872" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_280"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165872: Don&#039;t use &quot;ha&quot; as an informal in hungarian</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167196" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_281"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167196: Implement page_level (OCR) model for frwikisource</span></span></a></li>
<li class="remarkup-list-item"><a href="/T165044" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_282"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165044: Deploy ORES review tool on French Wikipedia</span></span></a></li>
</ul>

<h3 class="remarkup-header">Data release -- Monthly Article Quality predictions (English Wikipedia)</h3>

<p>This was a long time coming.  We&#039;ve got the data that allowed us to measure <a href="https://blog.wikimedia.org/2017/03/07/the-keilana-effect/" class="remarkup-link remarkup-link-ext" rel="noreferrer">the coverage gap of articles about Women Scientists in Wikipedia</a> hosted in labs (<a href="https://phabricator.wikimedia.org/T146718" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_283"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T146718</span></span></a>).  That means the table can be queried directly from Quarry.  See <a href="https://quarry.wmflabs.org/query/20169" class="remarkup-link remarkup-link-ext" rel="noreferrer">this demo query</a>.</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T146718" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_284"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T146718: [Discuss] Hosting the monthly article quality dataset on labsDB</span></span></a></li>
</ul>

<h3 class="remarkup-header">New features for ORES/revscoring</h3>

<p>Prompted by concerns raised by <a href="https://phabricator.wikimedia.org/p/Catrope/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_330"><span class="phui-tag-core phui-tag-color-person">@Catrope</span></a> from the <a href="/tag/collaboration-team-triage/" class="phui-tag-view phui-tag-type-shade phui-tag-disabled phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_329"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-users" data-meta="0_328" aria-hidden="true"></span>Collaboration-Team-Triage</span></a>, we have been working on a better way to represent information about a model (<a href="https://phabricator.wikimedia.org/T162217" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_285"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162217</span></span></a>): build environment, statistics, prediction thresholds, etc.  We&#039;ve even built a way to allow for querying the thresholds of a model that we refer to as &quot;threshold optimizations&quot;.  This refactoring gave us an opportunity to address some other outstanding wants with regards to revscoring -- e.g. storing more information about the build environment (<a href="https://phabricator.wikimedia.org/T160223" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_286"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T160223</span></span></a>) and cleaning up our &quot;tune&quot; utility (<a href="https://phabricator.wikimedia.org/T163711" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_287"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163711</span></span></a>).</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T162217" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_288"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162217: Implement &quot;thresholds&quot;, deprecate &quot;pile of tests_stats&quot;</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163711" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_289"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163711: Use our own scoring models in `tune` utility</span></span></a></li>
<li class="remarkup-list-item"><a href="/T169157" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_290"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169157: revscoring train_model dies without --observations</span></span></a></li>
<li class="remarkup-list-item"><a href="/T160223" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_291"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T160223: Store the detailed system information inside of model files.  </span></span></a></li>
</ul>



<h3 class="remarkup-header">Wikilabels UX improvement &amp; maintenance.</h3>

<p>Thanks to <a href="https://phabricator.wikimedia.org/p/Jan_Dittrich/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_331"><span class="phui-tag-core phui-tag-color-person"><span class="phui-tag-dot phui-tag-color-grey"></span>@Jan_Dittrich</span></a> and <a href="https://phabricator.wikimedia.org/p/Pginer-WMF/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_332"><span class="phui-tag-core phui-tag-color-person">@Pginer-WMF</span></a>&#039;s feedback, we&#039;ve been working on addressing some user-experience issues.    These were mostly fixes to language to make the functionality of the system more clear (<a href="https://phabricator.wikimedia.org/T167079" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_292"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167079</span></span></a>, <a href="https://phabricator.wikimedia.org/T138736" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_293"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T138736</span></span></a>).  We also brought Wikilabels down for a short period of time on Tuesday July 11th for scheduled database maintenance (<a href="https://phabricator.wikimedia.org/T169933" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_294"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169933</span></span></a>).</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T169933" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_295"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169933: Notify Wikilabels users of short downtime on July 11 @ 1400 UTC</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167061" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_296"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167061: Early June 2017 Wiki labels deploy</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167079" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_297"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167079: Initial set of UX fixes for Wiki labels</span></span></a></li>
<li class="remarkup-list-item"><a href="/T138736" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_298"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T138736: Rename &quot;abandon&quot; button to something less confusing</span></span></a></li>
</ul>

<h3 class="remarkup-header">ORES Review Tool improvements</h3>

<p>We finished up some patchsets that were blocked for a long time on some fixes to core MediaWiki.  This allowed us to fixed highlighting in Special:RecentChanges and Special:Watchlist (<a href="https://phabricator.wikimedia.org/T155903" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_299"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T155903</span></span></a>, <a href="https://phabricator.wikimedia.org/T155930" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_300"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T155930</span></span></a>).</p>

<h3 class="remarkup-header">General ORES maintenance</h3>

<p>We&#039;ve done a bunch of maintenance to ORES to solve a variety of issues that cropped up.  E.g. improving tests (<a href="https://phabricator.wikimedia.org/T168007" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_301"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168007</span></span></a>), solving a regression in the basic ORES ui (<a href="https://phabricator.wikimedia.org/T149117" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_302"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T149117</span></span></a>), fixing our new precaching system (<a href="https://phabricator.wikimedia.org/T168674" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_303"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168674</span></span></a>) and enabling it to work with the new <a href="https://wikitech.wikimedia.org/wiki/EventStreams" class="remarkup-link remarkup-link-ext" rel="noreferrer">EventStreams</a> feed (<a href="https://phabricator.wikimedia.org/T166046" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_304"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166046</span></span></a>).</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T168007" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_305"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168007: Add API tests to ORES CI</span></span></a></li>
<li class="remarkup-list-item"><a href="/T149117" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_306"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T149117: ORES UI is broken</span></span></a></li>
<li class="remarkup-list-item"><a href="/T168920" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_307"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168920: ORES 500&#039;s on integers that can&#039;t be processed</span></span></a></li>
<li class="remarkup-list-item"><a href="/T168674" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_308"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168674: ORES POST precaching always fails with 500</span></span></a></li>
<li class="remarkup-list-item"><a href="/T149118" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_309"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T149118: ORES UI doesn&#039;t handle API errors</span></span></a></li>
<li class="remarkup-list-item"><a href="/T166046" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_310"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166046: Switch ores precache to use new EventStreams</span></span></a></li>
<li class="remarkup-list-item"><a href="/T162184" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_311"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162184: ORES swagger doc based API requests do not work</span></span></a></li>
</ul>

<h3 class="remarkup-header">Misc operations work, versions and styling</h3>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T169129" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_312"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169129: Remove custom apt repo from ores labs boxes</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167612" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_313"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167612: Make names for Wiki-AI diffusion repos consistent</span></span></a></li>
<li class="remarkup-list-item"><a href="/T165716" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_314"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165716: No new data on ores_classification on beta labs since march memory issue</span></span></a></li>
<li class="remarkup-list-item"><a href="/T169577" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_315"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169577: Apply mediawiki core styling convention on javascript files of ores</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167604" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_316"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167604: upgrade pytz to 2017.2 for revscoring</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167303" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_317"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167303: Update Travis CI from precise</span></span></a></li>
<li class="remarkup-list-item"><a href="/T167149" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_318"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T167149: Test if ORES celery can use the unix socket</span></span></a></li>
<li class="remarkup-list-item"><a href="/T168904" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_319"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T168904: Minor cleanup in Makefiles</span></span></a></li>
<li class="remarkup-list-item"><a href="/T169809" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_320"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169809: Set up larger ores-compute instance</span></span></a></li>
<li class="remarkup-list-item"><a href="/T169164" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_321"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169164: ORES puppet error on labs boxes, unable to set user to &quot;deploy-service&quot;</span></span></a></li>
<li class="remarkup-list-item"><a href="/T169473" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_322"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T169473: Add flake8 to travis checks</span></span></a></li>
</ul></div></content></entry><entry><title>Status update (June 3rd, 2017)</title><link href="/phame/live/8/post/52/status_update_june_3rd_2017/" /><id>https://phabricator.wikimedia.org/phame/post/view/52/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T20:24:26+00:00</published><updated>2017-07-21T16:55:04+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p><em>Updates now coming to the phame blog!  We made presentations and gathered new collaborators at the Wikimedia Hackathon 2017 in Vienna. ORES is back in api.php. Wikilabels has stats.  ORES in CODFW fell over for a while, but it&#039;s back.</em></p>

<p>Hey folks,</p>

<p>I&#039;ll be starting to post updates here on the phame blog from now on, but if you&#039;d prefer to be notified via the mailing lists we used to post to, that&#039;s OK.  I&#039;ll make sure that the highlights and the link to these posts gets pushed there too.</p>

<p>We had a big presence at the Wikimedia Hackathon 2017 in Vienna.  We kicked off a lot of new language focused collaborations and we deployed a new Item Quality model for Wikidata.</p>

<p>French and Finnish Wikipedias now have advance edit quality prediction support!</p>

<p>ORES is available through api.php again via rvprop=orescores and rcprop=oresscores.</p>

<p>Wiki labels now has a new stats reporting interface.  Check out <a href="https://labels.wmflabs.org/stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://labels.wmflabs.org/stats</a></p>

<p>We had a major hiccup when failing over to CODFW, but we worked it out and ORES is very happy again.</p>

<p>See the sections below for details.</p>

<h3 class="remarkup-header">Labeling campaigns</h3>

<p>We deployed a new edit quality labeling campaign to English Wiktionary(<a href="https://phabricator.wikimedia.org/T165876" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_333"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165876</span></span></a>) and we&#039;re looking for someone who can work as a liaison for this task.  We&#039;ve also deployed secondary labeling campaigns to Finnish Wikipedia(<a href="https://phabricator.wikimedia.org/T166558" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_334"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166558</span></span></a>) and Turkish Wikipedia(<a href="https://phabricator.wikimedia.org/T164672" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_335"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164672</span></span></a>).  These secondary campaigns help us improve ORES accuracy.</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T165867" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_336"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165867: Edit quality campaign for English Wiktionary</span></span></a></li>
<li class="remarkup-list-item"><a href="/T166558" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_337"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166558: Edit quality campaign (v2) for Finnish Wikipedia</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164672" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_338"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164672: 2016 edit quality labeling campaign for trwiki</span></span></a></li>
</ul>

<h3 class="remarkup-header">Outreach &amp; comms</h3>

<p>We hosted a session at the Wikimedia Hackathon to tell people about ORES and show how to work with us to get support for your local wiki(<a href="https://phabricator.wikimedia.org/T165397" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_339"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165397</span></span></a>).  We also worked with the Collaboration Team to announce that ORES Review Tool would not be enabled by default and the New Filters would be deployed as a beta feature(<a href="https://phabricator.wikimedia.org/T163153" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_340"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163153</span></span></a>).</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T165397" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_341"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165397: Hack session: Who wants some ORES? </span></span></a></li>
<li class="remarkup-list-item"><a href="/T163153" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_342"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163153: Communicate new beta prefs and changes to ORES users specifically</span></span></a></li>
</ul>



<h3 class="remarkup-header">New development</h3>

<p>Lots of things here.  In our modeling library, we implemented the basics of Greek and Bengali language assets so that we can start working on prediction models(<a href="https://phabricator.wikimedia.org/T166793" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_343"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166793</span></span></a>, <a href="https://phabricator.wikimedia.org/T162620" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_344"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162620</span></span></a>).  After talking to people at the Wikimedia Hackathon about peculiar language overlap, we implemented a regex exclusions strategy(<a href="https://phabricator.wikimedia.org/T166793" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_345"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166793</span></span></a>) that will allow us to clearly state that &quot;ha&quot; is not laughing in Hungarian or Italian, but it is in a lot of other contexts.</p>

<p>We also spent some time exploring the overlap of the &quot;damaging&quot; and &quot;goodfaith&quot; models on Wikipedia(<a href="https://phabricator.wikimedia.org/T163995" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_346"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163995</span></span></a>).  We were able to show that there&#039;s useful overlap that will allow editors working on newcomer socialization to find goodfaith newcomer who are running into trouble.  The Collaboration Team adjusted the thresholds in New Filters in response to our analysis(<a href="https://phabricator.wikimedia.org/T164621" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_347"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164621</span></span></a>).</p>

<p>Using data from Wiki labels(<a href="https://phabricator.wikimedia.org/T157495" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_348"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T157495</span></span></a>), we trained a basic <em>item quality</em> model for Wikidata(<a href="https://phabricator.wikimedia.org/T164862" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_349"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164862</span></span></a>) and demonstrated it at the Wikimedia Hackathon(<a href="https://phabricator.wikimedia.org/T166054" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_350"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166054</span></span></a>).  We used data from Wiki labels(<a href="https://phabricator.wikimedia.org/T130261" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_351"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T130261</span></span></a>, <a href="https://phabricator.wikimedia.org/T163012" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_352"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163012</span></span></a>) to build advanced edit quality models for French and Finnish Wikipedia(<a href="https://phabricator.wikimedia.org/T130282" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_353"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T130282</span></span></a>, <a href="https://phabricator.wikimedia.org/T163013" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_354"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163013</span></span></a>) and those are now deployed in ORES(<a href="https://phabricator.wikimedia.org/T166047" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_355"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166047</span></span></a>).</p>

<p>We implemented a new stats reporting interface in Wiki labels(<a href="https://phabricator.wikimedia.org/T139956" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_356"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T139956</span></span></a>) and announced it (<a href="https://phabricator.wikimedia.org/T166529" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_357"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166529</span></span></a>).  This interface makes it easier for people managing campaigns in Wiki labels to track progress.  It&#039;s a long time coming.  Props to <a href="https://phabricator.wikimedia.org/p/Ladsgroup/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_407"><span class="phui-tag-core phui-tag-color-person">@Ladsgroup</span></a> for doing a bunch of work to make it happen.</p>

<p>Finally, we implemented a new &quot;score_revisions&quot; utility that makes it quick and easy to generate scores for a set of revisions using the ORES service(<a href="https://phabricator.wikimedia.org/T164547" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_358"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164547</span></span></a>).  This is really useful for researchers who want lots of scores and would like to avoid taking down ORES.  Personally, I&#039;ve been using it to audit ORES.</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T122727" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_359"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T122727: Greek language assets</span></span></a></li>
<li class="remarkup-list-item"><a href="/T166793" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_360"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166793: Implement regex exclusion strategy for badwords/informals</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163995" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_361"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163995: Lack of intersection between damaging &amp; goodfaith for English Wikipedia</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164621" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_362"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164621: Adjust ORES levels on en.wiki to get better overlap between good faith and damage</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164862" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_363"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164862: Train a basic item quality based on edit quality for Wikidata</span></span></a></li>
<li class="remarkup-list-item"><a href="/T166054" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_364"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166054: Visual demonstration of item quality model for Wikidata</span></span></a></li>
<li class="remarkup-list-item"><a href="/T157495" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_365"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T157495: Complete Wikidata item quality campaign</span></span></a></li>
<li class="remarkup-list-item"><a href="/T130261" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_366"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T130261: Complete frwiki edit quality campaign</span></span></a></li>
<li class="remarkup-list-item"><a href="/T130282" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_367"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T130282: Train/test damaging and goodfaith models for frwiki</span></span></a></li>
<li class="remarkup-list-item"><a href="/T166047" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_368"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166047: Deploy damaging &amp; goodfaith models to frwiki</span></span></a></li>
<li class="remarkup-list-item"><a href="/T139956" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_369"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T139956: Move campaign progress interface to wikilabels repo/system</span></span></a></li>
<li class="remarkup-list-item"><a href="/T166529" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_370"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T166529: Announce campaign stats and change wikilable phabricator tasks properly</span></span></a></li>
<li class="remarkup-list-item"><a href="/T162620" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_371"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162620: Add language support for Bengali</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163012" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_372"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163012: Train/test damaging &amp; goodfaith model for Finnish Wikipedia</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163013" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_373"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163013: Complete editquality campaign for Finnish Wikipedia</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164547" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_374"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164547: Implement score_revisions utility</span></span></a></li>
</ul>



<h3 class="remarkup-header">Maintenance and robustness</h3>

<p>We did a major deployment of ORES in mid-April(<a href="https://phabricator.wikimedia.org/T162892" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_375"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162892</span></span></a>) that had some serious problems in CODFW, but not EQIAD which was super confusing (<a href="https://phabricator.wikimedia.org/T163950" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_376"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163950</span></span></a>), so we re-routed traffic to EQIAD(<a href="https://gerrit.wikimedia.org/r/350487" class="remarkup-link remarkup-link-ext" rel="noreferrer">350487</a>).  While investigating, we found out that some timeouts(<a href="https://phabricator.wikimedia.org/T163944" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_377"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163944</span></span></a>) and server errors(<a href="https://phabricator.wikimedia.org/T163171" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_378"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163171</span></span></a>, <a href="https://phabricator.wikimedia.org/T163764" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_379"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163764</span></span></a>, <a href="https://phabricator.wikimedia.org/T163798" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_380"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163798</span></span></a>) were due to the same problem: There were two servers in CODFW that we didn&#039;t know existed so they weren&#039;t getting new deployment and were poisoning our worker queue with old code!</p>

<p>We also fixed a couple of regressions that popped up in the ORES Review Tool while new work was being done on New Filters (<a href="https://phabricator.wikimedia.org/T165011" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_381"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165011</span></span></a>, <a href="https://phabricator.wikimedia.org/T164984" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_382"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164984</span></span></a>).  We fixed some weird tokenization issues due to diacritics in Bengali not being handled correctly(<a href="https://phabricator.wikimedia.org/T164767" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_383"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164767</span></span></a>).</p>

<p>We re-enabled ORES in api.php(<a href="https://phabricator.wikimedia.org/T163687" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_384"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163687</span></span></a>).  Props to <a href="https://phabricator.wikimedia.org/p/Tgr/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_408"><span class="phui-tag-core phui-tag-color-person"><span class="phui-tag-dot phui-tag-color-orange"></span>@Tgr</span></a> for making this happen.</p>

<p>We fixed some issues with ORES swagger documentation(<a href="https://phabricator.wikimedia.org/T162184" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_385"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162184</span></span></a>) and some UI issues in Wiki labels related to button colors(<a href="https://phabricator.wikimedia.org/T163222" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_386"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163222</span></span></a>) and confusing error messages(<a href="https://phabricator.wikimedia.org/T138563" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_387"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T138563</span></span></a>).</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T162892" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_388"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162892: Deploy ORES mid-April</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163950" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_389"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163950: Investigate failed deploy to CODFW</span></span></a></li>
<li class="remarkup-list-item"><a href="https://gerrit.wikimedia.org/r/350487" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://gerrit.wikimedia.org/r/350487</a></li>
<li class="remarkup-list-item"><a href="/T163944" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_390"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163944: Timeouts on CODFW</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163171" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_391"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163171: ORES server error</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163764" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_392"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163764: Prod: Bad Request (400) on testwiki test models</span></span></a></li>
<li class="remarkup-list-item"><a href="/T165011" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_393"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T165011: Global default &#039;hard&#039; is invalid for field oresDamagingPref</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164984" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_394"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164984: Notice: Undefined property: stdClass::$ores_damaging_threshold in /srv/mediawiki/php-1.30.0-wmf.1/extensions/ORES/includes/Hooks.php on line 547</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164767" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_395"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164767: Fix bengali tokenization in deltas</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163687" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_396"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163687: Re-enable ORES data in action API</span></span></a></li>
<li class="remarkup-list-item"><a href="/T162184" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_397"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T162184: ORES swagger doc based API requests do not work</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163222" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_398"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163222: On labels.wmflabs.org, make the blue buttons more visible when they have been selected</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163798" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_399"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163798: Fix errorred state for missing rev_ids</span></span></a></li>
<li class="remarkup-list-item"><a href="/T138563" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_400"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T138563: Improve error message for suppressed changes shown in WikiLabels</span></span></a></li>
</ul>

<h3 class="remarkup-header">Documentation</h3>

<p>We finished off some data-flow diagrams for ORES(<a href="https://phabricator.wikimedia.org/T154441" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_401"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T154441</span></span></a>).  As part of transitioning to a Wikimedia Foundation team (Scoring Platform!  Woot!), we&#039;ve moved all the documentation for ORES and our team to Mediawiki.org(<a href="https://phabricator.wikimedia.org/T164991" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_402"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164991</span></span></a>).  Also, as part of the Tech Ops experimentation with failovers across datacenters, we updated our grafana metrics tracking to split metrics by datacenter(<a href="https://phabricator.wikimedia.org/T163212" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_403"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163212</span></span></a>).  This helped us quite a bit with diagnosing the deployment issues we discussed in the last section.</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="/T154441" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_404"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T154441: Draw rough data flow diagrams for ORES</span></span></a></li>
<li class="remarkup-list-item"><a href="/T164991" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_405"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T164991: Move ORES docs to mediawiki</span></span></a></li>
<li class="remarkup-list-item"><a href="/T163212" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_406"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T163212: Update grafana to split metrics by eqiad and codfw</span></span></a></li>
</ul>

<p>That&#039;s all folks.  I hope you enjoyed the new format!</p></div></content></entry><entry><title>Join my Reddit AMA about Wikipedia and ethical, transparent AI</title><link href="/phame/live/8/post/51/join_my_reddit_ama_about_wikipedia_and_ethical_transparent_ai/" /><id>https://phabricator.wikimedia.org/phame/post/view/51/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T18:35:42+00:00</published><updated>2017-07-21T16:55:54+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p><em>I wanted to let you know about an upcoming experimental Reddit AMA (&quot;ask me anything&quot;) chat we have planned. It will focus on artificial intelligence on Wikipedia and how we&#039;re working to counteract vandalism while also making life better for newcomers.</em></p>

<p>We plan to hold this chat on June 1st at 21:00 UTC/14:00 PST in the /r/iAMA subreddit[1]. I&#039;d love to answer any questions you have about these topics questions, and I&#039;ll send a follow-up email to this thread shortly before the AMA begins.</p>

<hr class="remarkup-hr" />

<p>For those who don&#039;t know who I am, I create artificial intelligences[2] that support the volunteers who edit Wikipedia[3]. I&#039;ve been fascinated by the ways that crowds of volunteers build massive, high quality information resources like Wikipedia for over ten years.</p>

<p>For more background, I research and then design technologies that make it easier to spot vandalism in Wikipedia—which helps support the hundreds of thousands of editors who make productive contributions. I also think a lot about the dynamics between communities and new users—and ways to make communities inviting and welcoming to both long-time community members and newcomers who may not be aware of community norms.  For a quick sampling of my work, check out my most impactful research paper about Wikipedia[3], some recent coverage of my work from *Wired*[4], or check out the master list of my projects on my WMF staff user page[5], the documentation for the technology team I run[9], or the home page for Wikimedia Research[8].</p>

<p>This AMA, which I&#039;m doing with with the Foundation&#039;s Communications department, is somewhat of an experiment. The intended audience for this chat is people who might not currently be a part of our community but have questions about the way we work—as well as potential research collaborators who might want to work with our data or tools. Many may be familiar with Wikipedia but not the work we do as a community behind the scenes.</p>

<p>I&#039;ll be talking about the work I&#039;m doing with the ethics of AI and how we think about artificial intelligence on Wikipedia, and ways we’re working to counteract vandalism on the world’s largest crowdsourced source of knowledge—like the ORES extension[6], which you may have seen highlighting possibly problematic edits on your watchlist and in RecentChanges.</p>

<p>I’d love for you to join this chat and ask questions.  If you do not or prefer not to use Reddit, we will also be taking questions on ORES&#039; MediaWiki talk page[7] and posting answers to both threads.</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://www.reddit.com/r/IAmA/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.reddit.com/r/IAmA/</a></li>
<li class="remarkup-list-item"><a href="https://en.wikipedia.org/wiki/Artificial_intelligence" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/Artificial_intelligence</a></li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/ORES</a></li>
<li class="remarkup-list-item"><a href="http://www-users.cs.umn.edu/~halfak/publications/The_Rise_and_Decline/halfaker13rise-preprint.pdf" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://www-users.cs.umn.edu/~halfak/publications/The_Rise_and_Decline/halfaker13rise-preprint.pdf</a></li>
<li class="remarkup-list-item"><a href="https://www.wired.com/2015/12/wikipedia-is-using-ai-to-expand-the-ranks-of-human-editors/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.wired.com/2015/12/wikipedia-is-using-ai-to-expand-the-ranks-of-human-editors/</a></li>
<li class="remarkup-list-item"><a href="https://en.wikipedia.org/wiki/User:Halfak_(WMF)" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/User:Halfak_(WMF)</a></li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/Extension:ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/Extension:ORES</a></li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/Talk:ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/Talk:ORES</a></li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/Wikimedia_Research" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/Wikimedia_Research</a></li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/Wikimedia_Scoring_Platform_team" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/Wikimedia_Scoring_Platform_team</a></li>
</ol>

<p>-<a href="https://phabricator.wikimedia.org/p/Halfak/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_409"><span class="phui-tag-core phui-tag-color-person">@Halfak</span></a> <br />
Principal Research Scientist @ WMF<br />
User:EpochFail / User:Halfak (WMF)</p>

<p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2017-May/000163.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2017-May/000163.html</a>)</p></div></content></entry><entry><title>Status update (April 14th, 2017)</title><link href="/phame/live/8/post/50/status_update_april_14th_2017/" /><id>https://phabricator.wikimedia.org/phame/post/view/50/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T18:30:08+00:00</published><updated>2017-08-31T20:25:15+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>In this update, I&#039;m going to change some things up to try and make this update easier for you to consume.  The biggest change you&#039;ll notice is that I&#039;ve broken up the [#] references in each section.  I hope that saves you some scrolling and confusion.  You&#039;ll also notice that I have changed the subject line from &quot;Revision scoring&quot; to &quot;Scoring Platform&quot; because it&#039;s now clear that, come July, I&#039;ll be leading a new team with that name at the Wikimedia Foundation.  There&#039;ll be an announcement about that coming once our budget is finalized.  I&#039;ll try to keep this subject consistent for the foreseeable future so that your email clients will continue to group the updates into one big thread.</p>

<h3 class="remarkup-header">Deployments &amp; maintenance:</h3>

<p>In this cycle, we&#039;ve gotten better at tracking our deployments and noting what changes do out with each deployment.  You can click on the phab task for a deployment and observe the sub-tasks to find out what was deployed.  We had 3 deployments for ORES since mid-march[1,2,3].  We&#039;ve had two deployments to Wikilabels[4,5] and we&#039;ve added a maintenance notices for a short period of downtime that&#039;s coming up on April 21st[6,7].</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160279" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_411"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_410" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160279</span></a> -- Deploy ores in prod (Mid-March)</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160638" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_413"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_412" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160638</span></a> -- Deploy ORES late march</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161748" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_415"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_414" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161748</span></a> -- Deploy ORES early April</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161002" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_417"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_416" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161002</span></a> -- Late march wikilabels deployment</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T163016" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_419"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_418" aria-hidden="true"></span>https://phabricator.wikimedia.org/T163016</span></a> -- Deploy Wikilabels mid-April</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T162888" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_421"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_420" aria-hidden="true"></span>https://phabricator.wikimedia.org/T162888</span></a> -- Add header to Wikilabels that warns of upcoming maintenance.</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T162265" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_423"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_422" aria-hidden="true"></span>https://phabricator.wikimedia.org/T162265</span></a> -- Manage wikilabels for labsdb1004 maintenance</li>
</ol>

<h3 class="remarkup-header">Making ORES better:</h3>

<p>We&#039;ve been working to make ORES easier to extend and more useful.  ORES now reports it&#039;s relevant versions at <a href="https://ores.wikimedia.org/versions[8]" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wikimedia.org/versions[8]</a>. We&#039;ve also reduced the complexity of our &quot;precaching&quot; system that scores edits before you ask for them[9,10].  We&#039;re taking  advantage of logstash to store and query our logs[11].  We&#039;ve also implemented some nice abstractions for requests and responses in ORES[12] that allowed us to improve our metrics tracking substantially[13].</p>

<ol class="remarkup-list" start="8">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T155814" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_425"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_424" aria-hidden="true"></span>https://phabricator.wikimedia.org/T155814</span></a> -- Expose version of the service and its dependencies</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T148714" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_427"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_426" aria-hidden="true"></span>https://phabricator.wikimedia.org/T148714</span></a> -- Create generalized &quot;precache&quot; endpoint for ORES</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T162627" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_429"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_428" aria-hidden="true"></span>https://phabricator.wikimedia.org/T162627</span></a> -- Switch <tt class="remarkup-monospaced">/precache</tt> to be a POST end point</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149010" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_431"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_430" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149010</span></a> -- Send ORES logs to logstash</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T159502" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_433"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_432" aria-hidden="true"></span>https://phabricator.wikimedia.org/T159502</span></a> -- Exclude precaching requests from cache_miss/cache_hit metrics</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161526" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_435"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_434" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161526</span></a> -- Implement ScoreRequest/ScoreResponse pattern in ORES</li>
</ol>

<h3 class="remarkup-header">New functionality:</h3>

<p>In the last month and a half, we&#039;ve added basic support to Korean Wikipedia[14,15]. Props to Revi for helping us work through a bunch of issues with our Korean language support[16,17,18].</p>

<p>We&#039;ve also gotten the ORES Review tool deployed to Hebrew Wikipedia[19,20,21,22] and Estonian Wikipedia[23,24,25].  We&#039;re also working with the Collaboration team to implement the threshold test statistics that they need to tune their new Edit Review interface[26] and we&#039;re working towards making this kind of work self-serve so that that product team and other tool developers won&#039;t have to wait on us to implement these threshold stats in the future[27].</p>

<ol class="remarkup-list" start="14">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161617" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_437"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_436" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161617</span></a> -- Deploy reverted model for kowiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161616" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_439"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_438" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161616</span></a> -- Train/test reverted model for kowiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160752" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_441"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_440" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160752</span></a> -- Korean generated word lists are in chinese</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160757" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_443"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_442" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160757</span></a> -- Add language support for Korean</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160755" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_445"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_444" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160755</span></a> -- Fix tokenization for Korean</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161621" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_447"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_446" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161621</span></a> -- Deploy ORES Review Tool for hewiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130284" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_449"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_448" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130284</span></a> -- Deploy edit quality models for hewiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160930" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_451"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_450" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160930</span></a> -- Train damaging and goodfaith models for hewiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130263" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_453"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_452" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130263</span></a> -- Complete hewiki edit quality campaign</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T159609" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_455"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_454" aria-hidden="true"></span>https://phabricator.wikimedia.org/T159609</span></a> -- Deploy ORES review tool to etwiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130280" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_457"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_456" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130280</span></a> -- Deploy edit quality models for etwiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T129702" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_459"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_458" aria-hidden="true"></span>https://phabricator.wikimedia.org/T129702</span></a> -- Complete etwiki edit quality campaign</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T162377" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_461"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_460" aria-hidden="true"></span>https://phabricator.wikimedia.org/T162377</span></a> -- Implement additional test_stats in editquality</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T162217" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_463"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_462" aria-hidden="true"></span>https://phabricator.wikimedia.org/T162217</span></a> -- Implement &quot;thresholds&quot;, deprecate &quot;pile of tests_stats&quot;</li>
</ol>

<h3 class="remarkup-header">ORES training / labeling campaigns:</h3>

<p>Thanks to a lot of networking at Wikimedia Conference and some help from Ijon (Asaf Batrov), we&#039;ve found a bunch of new collaborators to help us deploy ORES to new wikis.  As is critcial in this process, we need to deploy labeling campaigns so that Wikipedians can help us train ORES.</p>

<p>We&#039;ve got new editquality labeling campaigns deployed to Albanian[28], Finnish[29], Latvian[30], Korean[31], and Turkish[21] Wikipedias.</p>

<p>We&#039;ve also been working on a new type of model: &quot;Item quality&quot; in Wikidata.  We&#039;ve deployed, labeled, and analyzed a pilot[33], fixed some critical bugs that came up[34,35], and we&#039;ve finally launched a 5k item campaign which is already 17% done[36]!  See <a href="https://www.wikidata.org/wiki/Wikidata:Item_quality_campaign" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.wikidata.org/wiki/Wikidata:Item_quality_campaign</a> if you&#039;d like to help us out.</p>

<ol class="remarkup-list" start="28">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161981" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_465"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_464" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161981</span></a> -- Edit quality campaign for Albanian Wikipedia</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161905" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_467"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_466" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161905</span></a> -- Edit quality campaign for Finnish Wikipedia</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T162032" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_469"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_468" aria-hidden="true"></span>https://phabricator.wikimedia.org/T162032</span></a> -- Edit quality campaign for Latvian Wikipedia</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161622" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_471"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_470" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161622</span></a> -- Deploy editquality campaign in Korean Wikipedia</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161977" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_473"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_472" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161977</span></a> -- Start v2 editquality campaign for trwiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T159570" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_475"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_474" aria-hidden="true"></span>https://phabricator.wikimedia.org/T159570</span></a> -- Deploy the pilot of Wikidata item quality campaign</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160256" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_477"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_476" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160256</span></a> -- Wikidata items render badly in Wikilabels</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T162530" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_479"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_478" aria-hidden="true"></span>https://phabricator.wikimedia.org/T162530</span></a> -- Implement &quot;unwanted pages&quot; filtering strategy for Wikidata</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157493" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_481"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_480" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157493</span></a> -- Deploy Wikidata item quality campaign</li>
</ol>

<h3 class="remarkup-header">Bug fixing:</h3>

<p>As usual, we have a few weird bug that got in our way.  We needed to move to a bigger virtual machine in &quot;Beta Labs&quot; because our models take up a bunch of hard drive space[37].  We found that Wikilabels wasn&#039;t removing expired tasks correctly and that this was making it difficult to finish labeling campaigns[38].  We also had a lot of right-to-left issues when we did an upgrade of OOjs UI[39].  There was an old bug we had with <a href="https://translatewiki.net" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://translatewiki.net</a> in one of our message keys[40].</p>

<ol class="remarkup-list" start="37">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160762" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_483"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_482" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160762</span></a> -- deployment-ores-redis /srv/ redis is too small (500MBytes)</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161521" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_485"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_484" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161521</span></a> -- Wikilabels is not cleaning up expired tasks for Wikidata item quality campaign</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T161533" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_487"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_486" aria-hidden="true"></span>https://phabricator.wikimedia.org/T161533</span></a> -- Fix RTL issues in Wikilabels after OOjs UI upgrade</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T132197" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_489"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_488" aria-hidden="true"></span>https://phabricator.wikimedia.org/T132197</span></a> -- qqq for a wiki-ai message cannot be loaded</li>
</ol>

<p>-<a href="https://phabricator.wikimedia.org/p/Halfak/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_490"><span class="phui-tag-core phui-tag-color-person">@Halfak</span></a> <br />
Principal Research Scientist<br />
Head of the Scoring Platform Team</p>

<p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2017-April/000154.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2017-April/000154.html</a>)</p></div></content></entry><entry><title>Status update (March 16th, 2017)</title><link href="/phame/live/8/post/49/status_update_march_16th_2017/" /><id>https://phabricator.wikimedia.org/phame/post/view/49/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T18:25:01+00:00</published><updated>2017-06-03T18:25:01+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2017-March/000145.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2017-March/000145.html</a>)</p>

<p>Hey folks!</p>

<p>I should really stop calling this a weekly update because it&#039;s getting a bit silly at this point.  :)   But if it were a weekly update, it would cover the weeks of 42 - 46.</p>

<p><strong>Highlights:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">3 new models: Finnish Wikipedia (reverted) and Estonian Wikipedia (damaging &amp; goodfaith)</li>
<li class="remarkup-list-item">We estimated and agreed on funding for ORES servers in the next year with Operations</li>
<li class="remarkup-list-item">We published a paper about vandalism detection in Wikidata and a blog post about the massive effect of some initiatives on coverage of Women Scientists in Wikipedia.</li>
</ul>

<p><strong>New development:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We added recall-based threshold metrics to the new draftquality model which should help tool devs know what which new page creations to highlight for review[1]</li>
<li class="remarkup-list-item">We added optional notices for ORES pages which will help us visually distinguish our experimental install in WMFlabs from the Prod install ( ores.wikimedia.org)[2]</li>
<li class="remarkup-list-item">We added basic language support for Finish (Thanks 4shadoww)[3] and deployed a &#039;reverted&#039; model[4]</li>
<li class="remarkup-list-item">We lead a discussion in Wikidata about &quot;item quality&quot; that resulted in a Wikipedia 1.0 like scale for Wikidata quality[5,6] and designed a Wikilabels form to capture the gist of it[7]</li>
<li class="remarkup-list-item">We enabled the ORES Review Tool on Czech Wikipedia[8]</li>
<li class="remarkup-list-item">We configured ChangeProp to use our new minified JSON output to save bandwidth[9]</li>
<li class="remarkup-list-item">We extended the Estonian language assets (Thanks Cumbril)[10] and deployed the &#039;damaging&#039; and &#039;goodfaith&#039; models[11,12]</li>
<li class="remarkup-list-item">We enabled a testing model for &#039;goodfaith&#039; on the Beta Cluster to make it easier for the Collaboration team to run tests with their new filter interface[13]</li>
<li class="remarkup-list-item">We created a new &quot;precache&quot; endpoint that will allow us to de-duplicate configuration with ChangeProp and handle all routing in ORES locally[14]</li>
</ul>

<p><strong>Resourcing:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We completed a 2 year estimate of ORES resource needs and discussed funding (capital expendature) for ORES in the coming fiscal year[15].  This will allow us to continue to grow ORES both in number of models and in scoring capacity.</li>
</ul>

<p><strong>Communications:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Amir improved the KDD paper based on review feedback[16] and got it published[17]</li>
<li class="remarkup-list-item">We published a blob post about our measurements of WikiProject Women Scientists[18,19] -- &quot;The Keilana Effect&quot;</li>
<li class="remarkup-list-item">Thanks to Cumbril&#039;s work, the Estonian labeling campaing was finished[20]</li>
</ul>

<p><strong>Deployments:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">In early February, we deployed a new set of translations to Wikilabels (specifcally targeting Romanian Wikipedia)[21]</li>
<li class="remarkup-list-item">In mid-February, we deployed some fixes to ORES documentation and response formatting[22]</li>
<li class="remarkup-list-item">In mid-March, we deployed 3 new scoring models and ORES notices[23]</li>
</ul>

<p><strong>Maintenance and robustness:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We fixed a serious issue in the &quot;mwoauth&quot; library that Wikilabels depends on[24]</li>
<li class="remarkup-list-item">We reduced the number of revisions per request that we could receive via api.php[25]</li>
<li class="remarkup-list-item">We investigated a scap issue that broke ORES deployment[26]</li>
<li class="remarkup-list-item">We fixed a minor issue with JSON minification behavior[27] and hard-coding of the location of ORES in the documentation[28]</li>
<li class="remarkup-list-item">We improved performance of ORES filters on MediaWiki[29]</li>
<li class="remarkup-list-item">We improved the language describing ORES behavior on Special:Contributions[30]</li>
<li class="remarkup-list-item">We added a notice to the Wikipages that Dexbot maintains about its behavior[31]</li>
<li class="remarkup-list-item">We added notices to ores.wmflabs.org about it&#039;s experimental nature[32]</li>
<li class="remarkup-list-item">We fixed some issues with testing Finnish language assets[33]</li>
<li class="remarkup-list-item">We fixed some styling issues that resulted from an upgrade of OOJS UI[34]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157454" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_492"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_491" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157454</span></a> -- Add recall based thresholds to draftquality model</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150962" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_494"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_493" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150962</span></a> -- Add an optional notice to ORES main and ui pages</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T158587" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_496"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_495" aria-hidden="true"></span>https://phabricator.wikimedia.org/T158587</span></a> -- Add language support for Finnish</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160228" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_498"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_497" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160228</span></a> -- Train/test reverted model for fiwiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157489" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_500"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_499" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157489</span></a> -- [Discuss] item quality in Wikidata</li>
<li class="remarkup-list-item"><a href="https://www.wikidata.org/wiki/Wikidata:Item_quality" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.wikidata.org/wiki/Wikidata:Item_quality</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T155828" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_502"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_501" aria-hidden="true"></span>https://phabricator.wikimedia.org/T155828</span></a> -- Design item_quality form for Wikidata</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T151611" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_504"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_503" aria-hidden="true"></span>https://phabricator.wikimedia.org/T151611</span></a> -- Enable ORES Review Tool on Czech Wikipedia</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157693" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_506"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_505" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157693</span></a> -- Use minified JSON format in ChangeProp</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160193" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_508"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_507" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160193</span></a> -- Extend estonian language assets from Wiki page</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T159608" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_510"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_509" aria-hidden="true"></span>https://phabricator.wikimedia.org/T159608</span></a> -- Train/test damaging/goodfaith models for etwiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130280" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_512"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_511" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130280</span></a> -- Deploy edit quality models for etwiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160467" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_514"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_513" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160467</span></a> -- Enable &#039;goodfaith&#039; on testwiki on Beta Cluster</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T148714" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_516"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_515" aria-hidden="true"></span>https://phabricator.wikimedia.org/T148714</span></a> -- Create generalized &quot;precache&quot; endpoint for ORES</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157222" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_518"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_517" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157222</span></a> -- Estimate ORES capex for FY2017-18</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T148443" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_520"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_519" aria-hidden="true"></span>https://phabricator.wikimedia.org/T148443</span></a> -- Improve the KDD paper based on the review</li>
<li class="remarkup-list-item"><a href="https://arxiv.org/abs/1703.03861" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://arxiv.org/abs/1703.03861</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160078" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_522"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_521" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160078</span></a> -- Blog post about wp10 measurements of Women Scientists</li>
<li class="remarkup-list-item"><a href="https://blog.wikimedia.org/2017/03/07/the-keilana-effect/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://blog.wikimedia.org/2017/03/07/the-keilana-effect/</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T129702" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_524"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_523" aria-hidden="true"></span>https://phabricator.wikimedia.org/T129702</span></a> -- Complete etwiki edit quality campaign</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157580" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_526"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_525" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157580</span></a> -- Deploy Romanian translations for Wiki labels</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157842" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_528"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_527" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157842</span></a> -- Prod deployment of ORES</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160279" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_530"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_529" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160279</span></a> -- Deploy ores in prod (Mid-March)</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157858" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_532"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_531" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157858</span></a> -- mwoauth is broken</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157983" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_534"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_533" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157983</span></a> -- Reduce the number of revisions that can be requested in one batch</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157623" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_536"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_535" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157623</span></a> -- Investigate failed ORES deployment</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157721" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_538"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_537" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157721</span></a> -- Investigate default JSON minification behavior in production</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T157723" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_540"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_539" aria-hidden="true"></span>https://phabricator.wikimedia.org/T157723</span></a> -- ORES swagger is hard-coded for wmflabs</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T152585" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_542"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_541" aria-hidden="true"></span>https://phabricator.wikimedia.org/T152585</span></a> -- rcshow=oresreview is slow</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T158862" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_544"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_543" aria-hidden="true"></span>https://phabricator.wikimedia.org/T158862</span></a> -- Fix message in Special:Contributions</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T158899" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_546"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_545" aria-hidden="true"></span>https://phabricator.wikimedia.org/T158899</span></a> -- Add notice about Dexbot overwriting manual changes to our tracking table.</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T159055" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_548"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_547" aria-hidden="true"></span>https://phabricator.wikimedia.org/T159055</span></a> -- Add a notice to ores-wmflabs-deploy about &quot;experimental&quot; nature</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160192" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_550"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_549" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160192</span></a> -- Fix testing issues in finnish language assets</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T160258" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_552"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_551" aria-hidden="true"></span>https://phabricator.wikimedia.org/T160258</span></a> -- Fix minor styling issues with OOJS-UI in wikilabels</li>
</ol>

<p>Sincerely,<br />
Aaron from the Scoring Platform team</p></div></content></entry><entry><title>AI Wishlist initialized and a new Phab Tag (January 31st, 2017)</title><link href="/phame/live/8/post/48/ai_wishlist_initialized_and_a_new_phab_tag_january_31st_2017/" /><id>https://phabricator.wikimedia.org/phame/post/view/48/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T18:21:43+00:00</published><updated>2017-07-21T16:56:47+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>I hosted the AI Wishlist session at the Developer Summit(<a href="https://phabricator.wikimedia.org/T147710" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_553"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T147710</span></span></a>).  At that session, we brainstormed a set of AIs that we think would be interesting to implement.  Generally I asked people to do their best to follow template that would help us remember why the AI was important, what it would help with, and what resources might help get it implemented. See <a href="/tag/artificial-intelligence/" class="phui-tag-view phui-tag-type-shade phui-tag-yellow phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_555"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-tags" data-meta="0_554" aria-hidden="true"></span>artificial-intelligence</span></a></p>

<p>Well, I&#039;ve taken all of the notes and filed a large set of phab tasks under a new &quot;artificial-intelligence&quot; tag.  Please review all of the fun, new proposals that are listed there and make sure you subscribe to those that you&#039;re interested in.</p>

<p>-<a href="https://phabricator.wikimedia.org/p/Halfak/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_556"><span class="phui-tag-core phui-tag-color-person">@Halfak</span></a></p>

<p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2017-January/000134.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2017-January/000134.html</a>)</p></div></content></entry><entry><title>New version of Wiki labels (January 8th, 2017)</title><link href="/phame/live/8/post/47/new_version_of_wiki_labels_january_8th_2017/" /><id>https://phabricator.wikimedia.org/phame/post/view/47/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T18:20:13+00:00</published><updated>2017-06-03T18:20:13+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2017-January/000130.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2017-January/000130.html</a>)</p>

<p>Hello,</p>

<p>Wikilabels [1] is the system to label edits for ORES. Until now, users would have to visit a page in Wikipedia, for example WP:Labels [2] and install a gadget and then label edits for ORES. With the new version (0.4.0) deployed today, you can directly go to Wikilabels home page, for example <a href="https://labels.wmflabs.org/ui/enwiki" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://labels.wmflabs.org/ui/enwiki</a> and label edits from there. If you installed the gadget, you can remove it now. We also provided some sort of minification and bundling to improve its performance.</p>

<p>Labeling edits would help ORES work more accurately and in case ORES review tool is not enabled in your wiki, you can provide these data for us using wikilabels so can enable it for your wiki as well!</p>

<p>[1] <a href="https://meta.wikimedia.org/wiki/Wiki_labels" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Wiki_labels</a><br />
[2] <a href="https://en.wikipedia.org/wiki/Wikipedia:Labels" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/Wikipedia:Labels</a></p>

<h3 class="remarkup-header">Best</h3>

<p>Amir Sarabadani Tafreshi (<a href="https://phabricator.wikimedia.org/p/Ladsgroup/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_557"><span class="phui-tag-core phui-tag-color-person">@Ladsgroup</span></a>)<br />
Software Engineer (contractor)</p></div></content></entry><entry><title>Status update (November 29th, 2016)</title><link href="/phame/live/8/post/46/status_update_november_29th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/46/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T18:16:34+00:00</published><updated>2017-06-03T18:17:18+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(The post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-November/000118.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-November/000118.html</a>)</p>

<p>Hey,</p>

<p>This is the 30th and 31st weekly update from the revision scoring team that<br />
we have sent to this mailing list.  We accidentally skipped a week again.</p>

<p><strong>New development:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We added a new &quot;lowest&quot; sensitivity level to ORES review tool.  This new sensistivity level will only flag edits that ORES is very confident are actually damaging[1].</li>
<li class="remarkup-list-item">We applied the MediaWiki standard color palette to Wikilabels[2]</li>
<li class="remarkup-list-item">We generated a manually censored public dataset of spam/vandalism/attack pages[3].  This will help others to develop spam, vandalism and attack page detection models.  See the publication of the dataset[4].</li>
<li class="remarkup-list-item">We&#039;ve implement color-based confidence reporting for ORES damage detection[5]</li>
</ul>

<p><strong>Maintenance and robustness:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We updated the version of OOjs-UI that gets bundled with Wiki labels[6] and moved the static assets to a new repositiory[7]</li>
<li class="remarkup-list-item">We fixed an issue in the recscoring library[8] that caused ORES to return invalid JSON and rendered the UI useless[9].</li>
</ul>

<p><strong>Communications:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We gave a 3 minute presentation on the state of ORES to Victoria Coleman, the WMF&#039;s new CTO[10].</li>
<li class="remarkup-list-item">We performed a basic analysis of Wikipedia article quality trends using the dataset we released a few weeks ago[11].  We&#039;ll have a more substantial analysis soon.</li>
<li class="remarkup-list-item">We made a post on the ORES review tool talk page[12,13] detailing how we plan to incorporate a new filtering strategy into the ORES review tool. Please join the discussion there.</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150224" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_559"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_558" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150224</span></a> -- Add &quot;Lowest&quot; ORES sensitivity for fpr=0.1</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T151119" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_561"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_560" aria-hidden="true"></span>https://phabricator.wikimedia.org/T151119</span></a> -- Apply ui standardization color palette to Wikilabels</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150307" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_563"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_562" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150307</span></a> -- Create manually vetted dataset of spam/vandalism/attack pages</li>
<li class="remarkup-list-item"><a href="https://dx.doi.org/10.6084/m9.figshare.4245035" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://dx.doi.org/10.6084/m9.figshare.4245035</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144922" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_565"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_564" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144922</span></a> -- Visually report damaging confidence</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T151222" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_567"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_566" aria-hidden="true"></span>https://phabricator.wikimedia.org/T151222</span></a> -- Update bundled OOJS-ui with Wikilabels</li>
<li class="remarkup-list-item"><a href="https://github.com/wiki-ai/flask-oojsui" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://github.com/wiki-ai/flask-oojsui</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150961" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_569"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_568" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150961</span></a> -- ORES ui is broken (text field disabled)</li>
<li class="remarkup-list-item"><a href="https://github.com/wiki-ai/ores/issues/177" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://github.com/wiki-ai/ores/issues/177</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150544" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_571"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_570" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150544</span></a> -- ORES (a 2-3 minute presentation)</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T151214" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_573"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_572" aria-hidden="true"></span>https://phabricator.wikimedia.org/T151214</span></a> -- Basic analysis of Wikipedia quality using monthly predictions</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150858" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_575"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_574" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150858</span></a> -- Post about ORES review tool including ERI filters</li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/Topic:Tflhjj5x1numzg67" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/Topic:Tflhjj5x1numzg67</a></li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>Changes coming to ORES review tool (November 26th, 2016)</title><link href="/phame/live/8/post/45/changes_coming_to_ores_review_tool_november_26th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/45/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:54:00+00:00</published><updated>2022-07-14T01:24:22+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-November/000117.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-November/000117.html</a>)</p>

<p>Hey,</p>

<p>With merge of 320328 [1] and 320341 [2], two major changes will come to ORES<br />
review tool:<br />
1- You will see one more option in ORES sensitivity called &quot;Lowest&quot;. It<br />
means if you choose it, it only flags edit that are very likely to be<br />
vandalism.<br />
2- Coloring of rows will be completely different. You will see several<br />
colors instead of one and as confidence of ORES grows, the colors will tend<br />
to be more noticeable. It goes without saying that you can change these<br />
colors in your own css. I put a screenshot in [3] and you can test it in<br />
<a href="https://en.wikipedia.beta.wmflabs.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.beta.wmflabs.org</a> or <a href="https://mw-revscoring.wmflabs.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://mw-revscoring.wmflabs.org</a></p>

<p>Feedback is always welcome</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://gerrit.wikimedia.org/r/#/c/320328/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://gerrit.wikimedia.org/r/#/c/320328/</a></li>
<li class="remarkup-list-item"><a href="https://gerrit.wikimedia.org/r/#/c/320341/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://gerrit.wikimedia.org/r/#/c/320341/</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144922#2824696" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_577"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_576" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144922#2824696</span></a></li>
</ol>

<h3 class="remarkup-header">Best</h3>

<p>Amir Sarabadani Tafreshi (<a href="https://phabricator.wikimedia.org/p/Ladsgroup/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_578"><span class="phui-tag-core phui-tag-color-person">@Ladsgroup</span></a>)<br />
Software Engineer (contractor)</p></div></content></entry><entry><title>Awesome AI topics in need of discussion (Dev Summit) (November 18th, 2016)</title><link href="/phame/live/8/post/44/awesome_ai_topics_in_need_of_discussion_dev_summit_november_18th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/44/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:52:12+00:00</published><updated>2017-06-03T17:52:12+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-November/000116.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-November/000116.html</a>)</p>

<p>Hey folks,</p>

<p>I&#039;m your friendly facilitator for who forgot that today was the last day to gather discussion on a set of topics of the Dev Summit.  I might be a bit biased, but I think they are all pretty interesting, so I&#039;m reaching out with a quick overview to see if I can spur some interest from ya&#039;ll.  Check &#039;em out:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149373" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_580"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_579" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149373</span></a> -- Evaluating the user experience of AI systems</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T147710" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_582"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_581" aria-hidden="true"></span>https://phabricator.wikimedia.org/T147710</span></a> -- Building an AI wishlist &amp; working groups for Wikimedia Projects</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T148690" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_584"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_583" aria-hidden="true"></span>https://phabricator.wikimedia.org/T148690</span></a> -- Where to surface AI in Wikimedia Projects</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T147929" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_586"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_585" aria-hidden="true"></span>https://phabricator.wikimedia.org/T147929</span></a> -- Algorithmic dangers and transparency -- Best practices</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149666" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_588"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_587" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149666</span></a> -- Next steps for machine translation</li>
</ul>

<p>If you&#039;re interested, please drop a note or a token in the task.  BTW, you don&#039;t have to physically attend the dev summit in order to participate. I&#039;ll make sure that IRC and Etherpad are shared with all remote attendees who want to attend the sessions I&#039;m helping to organize.  I&#039;ve heard that there will be additional facilities for remote attendees (maybe a youtube stream!?) this year, but I can&#039;t confirm yet.</p>

<p>-Aaron</p></div></content></entry><entry><title>Including new filter interface in ORES review tool (November 18th, 2016)</title><link href="/phame/live/8/post/43/including_new_filter_interface_in_ores_review_tool_november_18th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/43/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:50:22+00:00</published><updated>2017-06-03T17:50:22+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-November/000115.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-November/000115.html</a>)</p>

<p>Hey folks,</p>

<p>I made a post at <a href="https://www.mediawiki.org/wiki/Topic:Tflhjj5x1numzg67" class="remarkup-link remarkup-link-ext" rel="noreferrer">mw:Topic:Tflhjj5x1numzg67</a> about including the<br />
new advanced filtering interface that the Collaboration Team is working on<br />
in the ORES beta feature. See the original post and add any discussion<br />
there.</p>

<p>Here&#039;s a demo of what the new filtering interface will look like:<br />
<a href="https://www.mediawiki.org/wiki/File:New-feature_demo%E2%80%94smart_Recent_Changes_filtering_with_ORES.webm" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/File:New-feature_demo%E2%80%94smart_Recent_Changes_filtering_with_ORES.webm</a></p>

<p>-Aaron</p></div></content></entry><entry><title>Status update (November 10th, 2016)</title><link href="/phame/live/8/post/42/status_update_november_10th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/42/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:48:39+00:00</published><updated>2017-06-03T17:48:39+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-November/000114.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-November/000114.html</a>)</p>

<p>Hey,</p>

<p>This is the 29th weekly update from revision scoring team that we have sent<br />
to this mailing list.</p>

<p>Deployments:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We deployed logging changes to ORES that will reduce the verbosity[1]</li>
<li class="remarkup-list-item">We also deployed revscoring 1.3.0 and new models built with it to WMF labs[2].  This won&#039;t change anything important from a user-perspective, but it paves the way for developing new modeling strategies.</li>
</ul>

<p>Maintenance and robustness:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We fixed puppet so that log file directories are also created on the celery worker nodes (affects wmflabs)[3]</li>
<li class="remarkup-list-item">We fixed an issue with our recall_at_fpr metrics which was incorrectly defined and implemented a recall_at_precision metric to take its place[4]</li>
</ul>

<p>New development:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We&#039;ve made a lot of progress on modeling sentences and have just started experimenting with a sentence model from featured articles[5]</li>
<li class="remarkup-list-item">We&#039;re reviewing a dataset of spam/vandalism/attack new page creations for public release[6].  This dataset will help our collaborators work with us on modeling the quality of drafts and supporting new page triage.</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149730" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_590"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_589" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149730</span></a> -- Deploy logging changes to ORES</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150447" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_592"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_591" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150447</span></a> -- Deploy revscoring 1.3.0 and updated editquality and wikiclass to wmflabs</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149925" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_594"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_593" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149925</span></a> -- /srv/log/ores/ not created on worker nodes</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149825" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_596"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_595" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149825</span></a> -- Implement recall at precision (and fix FPR metrics)</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T148867" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_598"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_597" aria-hidden="true"></span>https://phabricator.wikimedia.org/T148867</span></a> -- Implement sentences datascources &amp; experiment with normalization.</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T150307" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_600"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_599" aria-hidden="true"></span>https://phabricator.wikimedia.org/T150307</span></a> -- Create manually vetted dataset of spam/vandalism/attack pages</li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>Status update (November 3rd, 2016)</title><link href="/phame/live/8/post/41/status_update_november_3rd_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/41/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:34:02+00:00</published><updated>2017-06-03T17:34:02+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-November/000113.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-November/000113.html</a>)</p>

<p>Hey,<br />
This is 28th weekly update from revision scoring team that we have sent to<br />
this mailing list.</p>

<p>Publications:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">*New dataset shows fifteen years of Wikipedia’s quality trends *posted in Wikimedia Blog [1,2]</li>
<li class="remarkup-list-item">Halfaker, Aaron (2016): Monthly Wikipedia article quality predictions. <em>figshare</em>. <a href="https://doi.org/10.6084/m9.figshare.3859800" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://doi.org/10.6084/m9.figshare.3859800</a> [3,4]</li>
</ul>

<p>Maintenance and robustness:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Now, most of ORES extension source code is covered by continuous integration tests [5]</li>
<li class="remarkup-list-item">In order to keep track of changes in ORES grafana dashboards, we keep their JSON content in github now [6]</li>
<li class="remarkup-list-item">Implemented new metric for grafana: datasources_extracted [7]</li>
</ul>

<p>New development</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Thanks to the reading team, Now ORES extension has API modules to expose ORES scores, [8] filter on recent changes and watchlist, [9] and exposing ores models data [10]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://blog.wikimedia.org/2016/10/27/wikipedia-quality-trends-dataset/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://blog.wikimedia.org/2016/10/27/wikipedia-quality-trends-dataset/</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146709" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_602"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_601" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146709</span></a></li>
<li class="remarkup-list-item"><a href="https://dx.doi.org/10.6084/m9.figshare.3859800" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://dx.doi.org/10.6084/m9.figshare.3859800</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145332" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_604"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_603" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145332</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146560" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_606"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_605" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146560</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149347" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_608"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_607" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149347</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T149199" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_610"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_609" aria-hidden="true"></span>https://phabricator.wikimedia.org/T149199</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143614" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_612"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_611" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143614</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143616" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_614"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_613" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143616</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143617" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_616"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_615" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143617</span></a></li>
</ol>

<p>Sincerely,<br />
<a href="https://phabricator.wikimedia.org/p/Ladsgroup/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_617"><span class="phui-tag-core phui-tag-color-person">@Ladsgroup</span></a> from the Revision Scoring team</p></div></content></entry><entry><title>ORES scores is injected to js configs (October 29th, 2016)</title><link href="/phame/live/8/post/40/ores_scores_is_injected_to_js_configs_october_29th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/40/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:18:26+00:00</published><updated>2017-06-03T17:19:08+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-October/000112.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-October/000112.html</a>)</p>

<p>Hey,</p>

<p>With deployment of I42300f9 [1] in wmf.23 which happened several days ago (depending on your wiki), ORES scores are injected to mediawiki javascript config variables. You can access ores data in Special:RecentChanges/Watchlist/Contributions using mw.config.get(&#039;oresData&#039;). It opens up a whole new level of functionality for gadgets. For example, I re-wrote a huge script called ScoredRevisions [2] into several lines [3]. Also, without needing to connect to ores.wikimedia.org, it&#039;s much faster than the original gadget. You can also write scripts to sort rows in recent changes based on their ORES scores, etc. As a fun task I made my recent changes look like a rainbow :D [4] [5]</p>

<p>The next level is to inject ORES thresholds as mediawiki config variables so we can write up wiki-agnostic gadgets.</p>

<p>I would really appreciate comments or ideas :)</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://gerrit.wikimedia.org/r/#/c/314449/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://gerrit.wikimedia.org/r/#/c/314449/</a></li>
<li class="remarkup-list-item"><a href="https://github.com/he7d3r/mw-gadget-ScoredRevisions/blob/master/src/ScoredRevisions.js" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://github.com/he7d3r/mw-gadget-ScoredRevisions/blob/master/src/ScoredRevisions.js</a></li>
<li class="remarkup-list-item"><a href="https://gist.github.com/Ladsgroup/e67e40500b64dd99dc7ab5c2fa34f261" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://gist.github.com/Ladsgroup/e67e40500b64dd99dc7ab5c2fa34f261</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144922#2736504" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_619"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_618" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144922#2736504</span></a></li>
<li class="remarkup-list-item"><a href="https://phab.wmfusercontent.org/file/data/hoibxop7mn4s2cooz4lz/PHID-FILE-w7bg5zw6a323zug7e6mj/pasted_file" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://phab.wmfusercontent.org/file/data/hoibxop7mn4s2cooz4lz/PHID-FILE-w7bg5zw6a323zug7e6mj/pasted_file</a></li>
</ol>

<p>Best,<br />
<a href="https://phabricator.wikimedia.org/p/Ladsgroup/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_620"><span class="phui-tag-core phui-tag-color-person">@Ladsgroup</span></a></p></div></content></entry><entry><title>Status update (October 24th, 2016)</title><link href="/phame/live/8/post/39/status_update_october_24th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/39/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:16:40+00:00</published><updated>2017-06-03T17:16:40+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-October/000111.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-October/000111.html</a>)</p>

<p>Hey,</p>

<p>This is the 26th and 27th weekly update from revision scoring team that we<br />
have sent to this mailing list. We forgot to send the update for last week!</p>

<p>Last week, we were featured in Research&#039;s quarterly review. In the last 3<br />
months, we achieved our goals to expand the ORES extension to 6 wikis (we<br />
made it to 8!) and to release datasets of article quality predictions. The<br />
minutes from the quarterly review are not yet online, but once they are,<br />
you&#039;ll be able to see them at [1].</p>

<p>Maintenance and robustness:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We discussed and decided on a set of strategies for handling goodfaith/naive DOS attacks on ORES[2]</li>
<li class="remarkup-list-item">We fixed an i18n issue in Wiki Labels[3]</li>
<li class="remarkup-list-item">We updated the article quality models (wikiclass/wp10) to use revscoring 1.3.0[4]</li>
<li class="remarkup-list-item">We investigated and solved a memory leak in our pre-caching utility[5]</li>
<li class="remarkup-list-item">We configured celery to send its logs to a place where we can read them for easier debugging[6]</li>
<li class="remarkup-list-item">We deployed a set of schema changes to constrain the ORES Review Tools database appropriately[7]</li>
<li class="remarkup-list-item">Also worth noting is that the services cluster (SCB) has been expanded[8].  ORES has now doubled in capacity</li>
</ul>

<p>Datasets</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We discussed how to make the historical article quality dataset available via quarry[8]. Regretfully, it seems that we&#039;ll not be able to do that for at least a couple of months.</li>
</ul>

<p>New development</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We&#039;ve implemented embedding of machine-readable scores in a JS variable on-wiki[9]. This will make it easier for tool developers to experiment with new ways of displaying Special:RecentChanges more easily. It&#039;s also a necessary precondition for adding color-based signaling of ORES&#039; confidence about an edit.</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://meta.wikimedia.org/wiki/Wikimedia_Foundation_metrics_and_activities_meetings/Quarterly_reviews/Research,_Design_Research,_Analytics,_and_Performance,_October_2016" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Wikimedia_Foundation_metrics_and_activities_meetings/Quarterly_reviews/Research,_Design_Research,_Analytics,_and_Performance,_October_2016</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T148347" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_622"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_621" aria-hidden="true"></span>https://phabricator.wikimedia.org/T148347</span></a> -- [Discuss] DOS attacks on ORES. What to do?</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T139587" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_624"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_623" aria-hidden="true"></span>https://phabricator.wikimedia.org/T139587</span></a> -- Revision not found error unformatted and not localized</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T147201" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_626"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_625" aria-hidden="true"></span>https://phabricator.wikimedia.org/T147201</span></a> -- Update wikiclass for revscoring 1.3.0</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146500" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_628"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_627" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146500</span></a> -- Investigate memory leak in precached</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T147898" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_630"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_629" aria-hidden="true"></span>https://phabricator.wikimedia.org/T147898</span></a> -- Send celery logs to /srv/log/ores instead of /var/lib/daemon.log</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T147734" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_632"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_631" aria-hidden="true"></span>https://phabricator.wikimedia.org/T147734</span></a> -- Review and deploy 309825</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T147903" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_634"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_633" aria-hidden="true"></span>https://phabricator.wikimedia.org/T147903</span></a> -- Expand SCB cluster</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146718" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_636"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_635" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146718</span></a> -- [Discuss] Hosting the monthly article quality dataset on labsDB</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143611" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_638"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_637" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143611</span></a> -- Embed machine readable ores scores as data on pages where ORES scores things</li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>Status update (October 11th, 2016)</title><link href="/phame/live/8/post/38/status_update_october_11th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/38/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:14:53+00:00</published><updated>2017-06-03T17:14:53+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-October/000106.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-October/000106.html</a>)</p>

<p>Hey,</p>

<p>This is the 24th and 25th weekly update from revision scoring team that we<br />
have sent to this mailing list. We skipped a week due to travel and other<br />
work.</p>

<p>Maintenance and robustness:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We improved the performance of RecentChanges fitlering in the ORES extension[1]</li>
<li class="remarkup-list-item">We built and ran a maintenance script to clean up duplicate cached data for the ORES extension[2,3]</li>
<li class="remarkup-list-item">We updated the editquality models for the new version of revscoring (1.3.0)[4] and made some upstream changes to json2tsv to make that easier[5]</li>
<li class="remarkup-list-item">We quited down some of our error reporting so that our logs take up less space[6]</li>
</ul>

<p>Datasets:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We generated a dataset that uses the &quot;wp10&quot; prediction model to assess article quality in monthly intervals for English, French, and Russian Wikipedia[7].  This should enable new research into the quality dynamics of these wikis.</li>
<li class="remarkup-list-item">We generated a dataset of vandalism, spam, and attack page creations for building a new &quot;draft quality&quot; model[8]</li>
</ul>

<p>Communication:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Presented about transparent/open AI development practices around ORES at the Association of Internet Researchers[9]</li>
</ul>

<p>New development:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We&#039;ve made substantial progress towards adding ORES data to MediaWiki&#039;s api.php endpoints with rcshow=oresreview[10] and rvprop=ores[11]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146111" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_640"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_639" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146111</span></a> -- hidenondamaging=1 query is extremely slow on enwiki</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145356" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_642"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_641" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145356</span></a> -- Ensure ORES data violating constraints do not affect production</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145503" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_644"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_643" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145503</span></a> -- Build a maintenance script to clean up duplicate data</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146410" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_646"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_645" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146410</span></a> -- Update editquality for revscoring 1.3.0</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146939" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_648"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_647" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146939</span></a> -- Add type decoding support to tsv2json</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146680" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_650"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_649" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146680</span></a> -- Quiet result.get Warning in tasks</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145655" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_652"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_651" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145655</span></a> -- Generate monthly article quality dataset</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T135644" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_654"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_653" aria-hidden="true"></span>https://phabricator.wikimedia.org/T135644</span></a> -- Generate spam and vandalism new page creation dataset</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T147706" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_656"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_655" aria-hidden="true"></span>https://phabricator.wikimedia.org/T147706</span></a> -- Present about ORES transparency at AoIR</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143616" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_658"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_657" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143616</span></a> -- Introduce rcshow=oresreview and similar ones</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143614" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_660"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_659" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143614</span></a> -- Introduce ORES rvprop</li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>Status update (September 28th, 2016)</title><link href="/phame/live/8/post/37/status_update_september_28th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/37/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:12:19+00:00</published><updated>2017-06-06T05:41:51+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000102.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000102.html</a>)</p>

<p>Hey,</p>

<p>This is the 23rd weekly update from revision scoring team that we have sent<br />
to this mailing list.</p>

<p>New development</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We implemented and demonstrated a linguistic/stylometric processing strategy that should give us more signal for finding vandalism and spam[1].  See the discussion on the AI list[2].</li>
<li class="remarkup-list-item">As part of our support for the Collaboration Team, we&#039;ve been producing tables of model statistics that correspond to set of thresholds[3].  This helps their designers work on strategies for reporting prediction confidence in an intuitive way.</li>
</ul>

<p>Maintenance and robustness</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We had a major downtime event that was caused by our logs being too verbose.  We&#039;ve recovered and turned down the log level[4].</li>
<li class="remarkup-list-item">We made sure that halfak got pings when ores.wikimedia.org goes down[5]</li>
</ul>

<p>Datasets</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We created a database on Wikimedia Labs that provides access to a dataset containing a complete set of article quality predictions for English Wikipedia[6].  See our announcements[7,8,9].</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146335" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_662"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_661" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146335</span></a> -- Implement a basic scoring strategy for PCFGs</li>
<li class="remarkup-list-item"><a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000098.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000098.html</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146280" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_664"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_663" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146280</span></a> -- Produce tables of stats for damaging and goodfaith models</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146581" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_666"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_665" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146581</span></a> -- celery log level is INFO causing disruption on ORES service</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146720" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_668"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_667" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146720</span></a> -- Ensure that halfak gets emails when  ores.wikimedia.org goes down</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T106278" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_670"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_669" aria-hidden="true"></span>https://phabricator.wikimedia.org/T106278</span></a> -- Setup a db on labsdb for article quality that is publicly accessible</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T146156" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_672"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_671" aria-hidden="true"></span>https://phabricator.wikimedia.org/T146156</span></a> -- Announce article quality database in labsdb</li>
<li class="remarkup-list-item"><a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000091.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000091.html</a></li>
<li class="remarkup-list-item"><a href="https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(technical)/Archive_149#ORES_article_quality_data_as_a_database_table" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(technical)/Archive_149#ORES_article_quality_data_as_a_database_table</a></li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>NLP (PCFG) work (September 28th, 2016)</title><link href="/phame/live/8/post/36/nlp_pcfg_work_september_28th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/36/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:05:55+00:00</published><updated>2017-06-03T17:20:15+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000098.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000098.html</a>)</p>

<p>I&#039;ve been looking at some recent work that used Probabilistic Context-free Grammars[1,2] to detect vandalism in Wikipedia.  I wanted to send a quick message to share some progress.</p>

<p>I&#039;ve built a python library that implements a really simple PCFG training and scoring strategy and written a quick demo of how it can work.  In the following demo, I show how we can build a probabilistic grammar using the I&#039;m a Little Teapot song[4].  Note how sentences that are not characteristic of the song score lower.  Note that scores are log-scaled.</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">&gt;&gt;&gt; sentences = [
...              &quot;I am a little teapot&quot;,
...              &quot;Here is my handle&quot;,
...              &quot;Here is my spout&quot;,
...              &quot;When I get all steamed up I just shout tip me over and pour me out&quot;,
...              &quot;I am a very special pot&quot;,
...              &quot;It is true&quot;,
...              &quot;Here is an example of what I can do&quot;,
...              &quot;I can turn my handle into a spout&quot;,
...              &quot;Tip me over and pour me out&quot;]
&gt;&gt;&gt;
&gt;&gt;&gt;
&gt;&gt;&gt; teapot_grammar = TreeScorer.from_tree_bank(bllip_parse(s) for s in sentences)
&gt;&gt;&gt;
&gt;&gt;&gt; teapot_grammar.score(bllip_parse(&quot;Here is a little teapot&quot;))
-9.392661928770137
&gt;&gt;&gt; teapot_grammar.score(bllip_parse(&quot;It is my handle&quot;))
-10.296301543090733
&gt;&gt;&gt; teapot_grammar.score(bllip_parse(&quot;I am a spout&quot;))
-10.40166205874856
&gt;&gt;&gt; teapot_grammar.score(bllip_parse(&quot;Your teapot is gay&quot;))
-12.96352974967269
&gt;&gt;&gt; teapot_grammar.score(bllip_parse(&quot;Your mom&#039;s teapot is asldasnldansldal&quot;))
-19.424997926026403</pre></div>

<p>This work is inspired by work that Arthur Tilley (<a href="https://phabricator.wikimedia.org/p/aetilley/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_673"><span class="phui-tag-core phui-tag-color-person">@aetilley</span></a>) did on our team a last year[5].  The &#039;kasami&#039; library represents a narrow slice of Arthur&#039;s work.</p>

<p>Next, I&#039;m working on building out revscoring to implement some features<br />
that use the scoring strategy on sentenced modified in an edit.  I&#039;m hoping<br />
that this type of feature engineering will allow us to catch edits that<br />
make articles more/less notable.  I&#039;m also targeting spammy language and<br />
insults.</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://en.wikipedia.org/wiki/Stochastic_context-free_grammar" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/Stochastic_context-free_grammar</a></li>
<li class="remarkup-list-item"><a href="http://pub.cs.sunysb.edu/~rob/papers/acl11_vandal.pdf" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://pub.cs.sunysb.edu/~rob/papers/acl11_vandal.pdf</a></li>
<li class="remarkup-list-item"><a href="https://github.com/halfak/kasami" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://github.com/halfak/kasami</a></li>
<li class="remarkup-list-item"><a href="https://en.wikipedia.org/wiki/I%27m_a_Little_Teapot" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/I%27m_a_Little_Teapot</a></li>
<li class="remarkup-list-item"><a href="https://github.com/aetilley/pcfg" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://github.com/aetilley/pcfg</a></li>
</ol>

<p>-<a href="https://phabricator.wikimedia.org/p/Halfak/" class="phui-tag-view phui-tag-type-person " data-sigil="hovercard" data-meta="0_674"><span class="phui-tag-core phui-tag-color-person">@Halfak</span></a></p></div></content></entry><entry><title>Status update (September 22nd, 2016)</title><link href="/phame/live/8/post/35/status_update_september_22nd_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/35/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:03:24+00:00</published><updated>2017-06-03T17:03:24+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000095.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000095.html</a>)</p>

<p>Hey,</p>

<p>This is the 22nd weekly update from revision scoring team that we have sent to this mailing list.</p>

<p>UI work:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We configured the default threshold for the ORES review tool on Wikidata to be more strict (higher recall, lower precision)[1]</li>
<li class="remarkup-list-item">We fixed a display issue on Special:Contributions where the filters would not wrap[2]</li>
</ul>

<p>Increasing model fitness:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We finished demonstrating model fitness gains using hash-vector features[3].  Next, we&#039;ll be working to get the hash-vector features implemented in revscoring/ORES[4].</li>
<li class="remarkup-list-item">We implemented a new strategy for training and testing on all data using cross-validation[5].  This will both increase the fitness of the models and make the statistics reported more robust.</li>
</ul>

<p>Maintenance and robustness</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We fixed an indexing issues in ores_model that prevented the deployment of updated models[6].</li>
<li class="remarkup-list-item">We did a minor investigation to a short period of degraded service quality on WMF Labs[7]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144784" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_676"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_675" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144784</span></a> -- Change default threshold for Wikidata to high</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143518" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_678"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_677" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143518</span></a> -- Filter on user contribs has nowrap, causing issues</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T128087" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_680"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_679" aria-hidden="true"></span>https://phabricator.wikimedia.org/T128087</span></a> -- [Spike] Investigate HashingVectorizer</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145812" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_682"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_681" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145812</span></a> -- Implement ~100 most important hash vector features in editquality models</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T142953" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_684"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_683" aria-hidden="true"></span>https://phabricator.wikimedia.org/T142953</span></a> -- Train on all data, Report test statistics on cross-validation</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144432" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_686"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_685" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144432</span></a> -- oresm_model index should not be unique</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145353" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_688"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_687" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145353</span></a> -- Investigate short period of ores-web-03 insanity</li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>Status update (September 14th, 2016)</title><link href="/phame/live/8/post/34/status_update_september_14th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/34/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T17:01:56+00:00</published><updated>2017-06-03T17:01:56+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000088.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000088.html</a>)</p>

<p>Hey,</p>

<p>This is the 21st weekly update from revision scoring team that we have sent<br />
to this mailing list.</p>

<p>New development</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We received a request to get moving on Spanish Wikibooks support, so we dug in:</li>
<li class="remarkup-list-item">We deployed a new Wiki labels campaign[1]</li>
<li class="remarkup-list-item">We fixed an issue in Wiki labels that prevented requests from *. wikibooks.org[2]</li>
<li class="remarkup-list-item">We trained a basic &quot;revert&quot; detection model that seems to be pretty effective[3]</li>
<li class="remarkup-list-item">We also generated a dataset of article quality scores for English Wikipedia[4].  You can download it here: [5]</li>
</ul>

<p>This week, we invested in some long term tasks.  If you review our<br />
phabricator board, you&#039;ll see substantial progress in improving our damage<br />
detection models with hashing vectorization strategies[6, 7], implementing<br />
a more robust model testing strategy[8], and implementing some advance<br />
natural language processing strategies[9, 10].  Stay tuned for the<br />
completion of these activities in the coming weeks.</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143962" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_690"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_689" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143962</span></a> -- Add uniqueness constraints to ores_classification</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145406" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_692"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_691" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145406</span></a> -- Fix CORS for wikibooks</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T145428" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_694"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_693" aria-hidden="true"></span>https://phabricator.wikimedia.org/T145428</span></a> -- Train/test reverted model for Spanish Wikibooks</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T135684" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_696"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_695" aria-hidden="true"></span>https://phabricator.wikimedia.org/T135684</span></a> -- Generate recent article quality scores for English Wikipedia</li>
<li class="remarkup-list-item"><a href="https://datasets.wikimedia.org/public-datasets/enwiki/article_quality/wp10-scores-enwiki-20160820.tsv.bz2" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://datasets.wikimedia.org/public-datasets/enwiki/article_quality/wp10-scores-enwiki-20160820.tsv.bz2</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T128087" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_698"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_697" aria-hidden="true"></span>https://phabricator.wikimedia.org/T128087</span></a> -- [Spike] Investigate HashingVectorizer</li>
<li class="remarkup-list-item"><a href="https://en.wikipedia.org/wiki/Feature_hashing" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/Feature_hashing</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T142953" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_700"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_699" aria-hidden="true"></span>https://phabricator.wikimedia.org/T142953</span></a> -- Train on all data, Report test statistics on cross-validation</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144636" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_702"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_701" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144636</span></a> -- Implement PCFG features</li>
<li class="remarkup-list-item"><a href="https://en.wikipedia.org/wiki/Stochastic_context-free_grammar" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://en.wikipedia.org/wiki/Stochastic_context-free_grammar</a></li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>Status update (September 6th, 2016)</title><link href="/phame/live/8/post/33/status_update_september_6th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/33/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:59:36+00:00</published><updated>2017-06-03T16:59:36+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000087.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000087.html</a>)</p>

<p>Hey,</p>

<p>This is the 20th weekly update from revision scoring team that we have sent<br />
to this mailing list.</p>

<p>New development:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We implemented the basic functionality for handling bag of words and other types of abstract feature vectors in <tt class="remarkup-monospaced">revscoring</tt>. [1] This required some changes to some dependencies as well. [2]</li>
<li class="remarkup-list-item">We extended the user-group related features to include more of the dominant groups outside of English Wikipedia [3] and incremented the models that changed substantially [4]</li>
</ul>

<p>Documentation:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We extended the documentation at mw:Extension:ORES to make it easier for new developers to work with us. [5]</li>
</ul>

<p>Resourcing:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We discussed the teams resourcing needs (hardware, engineering, and community liaison support) with Wes Moran. [6]</li>
</ul>

<p>Maintenance and robustness:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We addressed a variety of issues around caching and how the ORES extension loads new data</li>
<li class="remarkup-list-item">ORES now returns headers that will disable secondary caching. [7]</li>
<li class="remarkup-list-item">Our maintenance scripts will circumvent caches that do not listen to no-cache headers. [8, 9]</li>
<li class="remarkup-list-item">We fixed an issue where the ORES review tool would duplicate items in Special:RecentChanges. [10]</li>
<li class="remarkup-list-item">We standardized the extraction pattern for the enwiktionary model so that it looks similar to other models. [11]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T132580" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_704"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_703" aria-hidden="true"></span>https://phabricator.wikimedia.org/T132580</span></a> -- Implement abstraction for Sparse Feature Vectors</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144430" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_706"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_705" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144430</span></a> -- Update yamlconf so that import_path can handle deep attributes</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T143909" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_708"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_707" aria-hidden="true"></span>https://phabricator.wikimedia.org/T143909</span></a> -- Extend user group features</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144855" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_710"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_709" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144855</span></a> -- Increment ruwiki editquality models</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144676" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_712"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_711" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144676</span></a> -- Improve technical documentation in Extension:ORES in mediawiki.ore</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144517" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_714"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_713" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144517</span></a> -- ORES and Product: resourcing discussion</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144193" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_716"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_715" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144193</span></a> -- Set max-age header to 0 seconds for ORES to quiet secondary caches</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144196" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_718"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_717" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144196</span></a> -- Get model version needs to invalidate cache</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144195" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_720"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_719" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144195</span></a> -- Check model version replaces every time it runs.</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144233" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_722"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_721" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144233</span></a> -- Redundant results in ORES review tool</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T144605" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_724"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_723" aria-hidden="true"></span>https://phabricator.wikimedia.org/T144605</span></a> -- Fix makefile entry for enwiktionary.rev_reverted.20k_2016.tsv</li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p></div></content></entry><entry><title>ORES review tool deployment status (September 3rd, 2016)</title><link href="/phame/live/8/post/32/ores_review_tool_deployment_status_september_3rd_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/32/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:57:39+00:00</published><updated>2017-06-03T16:57:39+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-September/000085.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-September/000085.html</a>)</p>

<p>Hey folks,</p>

<p>I recently received an email asking for more information about how to get<br />
the ORES review tool[1] deployed in more wikis (we currently support 8:<br />
wikidata, fawiki, enwiki, nlwiki, ptwiki, plwiki, trwiki, ruwiki).  I<br />
figured that this summary should be shared more broadly, so I&#039;m pasting it<br />
below.</p>

<hr class="remarkup-hr" />

<p>This is the best guide that we have for users requesting support right now:<br />
<a href="https://meta.wikimedia.org/wiki/Objective_Revision_Evaluation_Service/Get_support" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Objective_Revision_Evaluation_Service/Get_support</a></p>

<p>There&#039;s a lot that we need to do in response to requests for support.  We<br />
currently have Wiki labels[2] edit quality campaigns running in</p>

<p><a href="http://labels.wmflabs.org/campaigns/arwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/arwiki/?campaigns=stats</a> (2464/4977 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/azwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/azwiki/?campaigns=stats</a> (0/5000 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/dewiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/dewiki/?campaigns=stats</a> (182/4177 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/enwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/enwiki/?campaigns=stats</a> (350/6333 labels, extension)<br />
<a href="http://labels.wmflabs.org/campaigns/eswiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/eswiki/?campaigns=stats</a> (1210/8434 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/etwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/etwiki/?campaigns=stats</a> (824/4678 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/fawiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/fawiki/?campaigns=stats</a> (1781/3156 labels, extension)<br />
<a href="http://labels.wmflabs.org/campaigns/frwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/frwiki/?campaigns=stats</a> (274/5000 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/hewiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/hewiki/?campaigns=stats</a> (1069/5000 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/huwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/huwiki/?campaigns=stats</a> (518/5000 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/idwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/idwiki/?campaigns=stats</a> (50/2200 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/itwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/itwiki/?campaigns=stats</a> (591/5390 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/jawiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/jawiki/?campaigns=stats</a> (356/9514 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/nowiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/nowiki/?campaigns=stats</a> (1815/5000 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/svwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/svwiki/?campaigns=stats</a> (1657/5000 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/ukwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/ukwiki/?campaigns=stats</a> (161/3318 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/urwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/urwiki/?campaigns=stats</a> (153/5000 labels)<br />
<a href="http://labels.wmflabs.org/campaigns/viwiki/?campaigns=stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://labels.wmflabs.org/campaigns/viwiki/?campaigns=stats</a> (0/5000 labels)</p>

<p>Note that we&#039;re developing a nice dashboard for this.  See <a href="http://tools.wmflabs.org/dexbot/tools/wikilabels_stats.php" class="remarkup-link remarkup-link-ext" rel="noreferrer">http://tools.wmflabs.org/dexbot/tools/wikilabels_stats.php</a></p>

<p>Note that two campaigns are labeled &quot;extension&quot; because we already have<br />
support for those wikis, but we are running campaigns to extend the<br />
observations in our labeled datasets for higher fitness.  In order to get<br />
these campaigns done, we need a local Wikipedian (or liaison) to call<br />
attention to the campaign and make sure that questions get answered and<br />
work continues.  The wikis that already have support are those wikis where<br />
we found a strong local collaborator to help.</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Wikidata - User:Ladsgroup</li>
<li class="remarkup-list-item">Ptwiki - User:He7d3r</li>
<li class="remarkup-list-item">Trwiki - User:WhiteCat</li>
<li class="remarkup-list-item">Enwiki - User:EpochFail</li>
<li class="remarkup-list-item">Fawiki - User:Ladsgroup</li>
<li class="remarkup-list-item">Ruwiki - User:Putnik</li>
<li class="remarkup-list-item">Nlwiki - User:Krinkle</li>
<li class="remarkup-list-item">Plwiki - User:Tar_Lócesilion</li>
</ul>

<p>As of right now, I&#039;m the only person who is officially working on ORES full<br />
time.  Amir (User:Ladsgroup) is funded to work on ORES 4 hours per week<br />
through WMDE.  So, any time that someone asks something from us, we in<br />
turn, ask for support in order to be able to do it.  The most critical<br />
support we could get for moving faster would be (1) community liaison<br />
support for identifying local collaborators and driving the current Wiki<br />
labels campaigns and (2) Engineering support to make the icky<br />
user-script[3] into a proper extension.</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/ORES_review_tool" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/ORES_review_tool</a></li>
<li class="remarkup-list-item"><a href="https://meta.wikimedia.org/wiki/Wiki_labels" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Wiki_labels</a></li>
<li class="remarkup-list-item"><a href="https://meta.wikimedia.org/wiki/Wiki_labels#Installation" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Wiki_labels#Installation</a></li>
</ol>

<p>-Aaron</p></div></content></entry><entry><title>Deployment of ORES review tool in Englis Wikipedia as a beta feature (August 23rd, 2016)</title><link href="/phame/live/8/post/31/deployment_of_ores_review_tool_in_englis_wikipedia_as_a_beta_feature_august_23rd_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/31/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:51:45+00:00</published><updated>2017-06-03T16:52:20+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>We <a href="https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service#Team&gt;" class="remarkup-link remarkup-link-ext" rel="noreferrer">The Revision Scoring Team</a><br />
are happy to announce the deployment of the <a href="https://meta.wikimedia.org/wiki/ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">ORES</a> <a href="https://www.mediawiki.org/wiki/ORES_review_tool" class="remarkup-link remarkup-link-ext" rel="noreferrer">review tool</a> as a <a href="https://en.wikipedia.org/wiki/Special:Preferences#mw-prefsection-betafeatures" class="remarkup-link remarkup-link-ext" rel="noreferrer">beta feature</a> on *English Wikipedia*. Once enabled, ORES highlights edits that are likely to be damaging in <a href="https://en.wikipedia.org/wiki/Special:RecentChanges" class="remarkup-link remarkup-link-ext" rel="noreferrer">Special:RecentChanges</a>, <a href="https://en.wikipedia.org/wiki/Special:Watchlist" class="remarkup-link remarkup-link-ext" rel="noreferrer">Special:Watchlist</a> and <a href="https://en.wikipedia.org/wiki/Special:Contributions" class="remarkup-link remarkup-link-ext" rel="noreferrer">Special:Contributions</a> to help you prioritize your patrolling work. ORES detects damaging edits using a basic prediction model based on <a href="https://meta.wikimedia.org/wiki/Research:Automated_classification_of_edit_quality" class="remarkup-link remarkup-link-ext" rel="noreferrer">past damage</a>.</p>

<p>ORES is an experimental technology. We encourage you to take advantage of it but also to be skeptical of the predictions made. It&#039;s a tool to support you – it can&#039;t replace you. Please reach out to us with your questions and concerns.</p>

<p><strong>Documentation:</strong> <a href="https://www.mediawiki.org/wiki/ORES_review_tool" class="remarkup-link remarkup-link-ext" rel="noreferrer">mw:ORES review tool</a>, <a href="https://www.mediawiki.org/wiki/Extension:ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">mw:Extension:ORES</a>, and <a href="https://meta.wikimedia.org/wiki/ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">m:ORES</a></p>

<p><strong>Bugs &amp; feature requests:</strong> #revision-scoring-as-a-service-backlog</p>

<p><strong>IRC:</strong> <a href="http://webchat.freenode.net/?channels=#wikimedia-ai" class="remarkup-link remarkup-link-ext" rel="noreferrer">#wikimedia-ai</a></p>

<p>Sincerely,<br />
Amir from the Revision Scoring team</p></div></content></entry><entry><title>New models coming to ORES &amp; notes (August 19th, 2016)</title><link href="/phame/live/8/post/30/new_models_coming_to_ores_notes_august_19th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/30/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:44:33+00:00</published><updated>2017-06-03T16:44:33+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-August/000068.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-August/000068.html</a>)</p>

<p>Hey folks,</p>

<p>We&#039;ve been working on generating some updated models for ORES.  These<br />
models will behave slightly differently from the models that we currently<br />
have deployed.  This is a natural artifact of retraining the models on the<br />
*exact same data* again because of some random properties of the learning<br />
algorithms.  So, for the most part, this should be a non-issue for any<br />
tools that use ORES.  However, I wanted to take this opportunity to<br />
highlight some of the facilities ORES provides to help automatically detect<br />
and adjust for these types of changes.</p>

<h3 class="remarkup-header">Versions</h3>

<p>ORES provides information about all of the models.  This information<br />
includes a model version number.  If you are caching ORES scores locally,<br />
we recommend invalidating old scores whenever this model number changes.<br />
For example, <a href="https://ores.wikimedia.org/v2/scores/enwiki/damaging/12345678" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wikimedia.org/v2/scores/enwiki/damaging/12345678</a><br />
currently returns</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">{
  &quot;scores&quot;: {
    &quot;enwiki&quot;: {
      &quot;damaging&quot;: {
        &quot;scores&quot;: {
          &quot;12345678&quot;: {
            &quot;prediction&quot;: false,
            &quot;probability&quot;: {
              &quot;false&quot;: 0.7141333465390294,
              &quot;true&quot;: 0.28586665346097057
            }
          }
        },
        &quot;version&quot;: &quot;0.1.1&quot;
      }
    }
  }
}</pre></div>

<p>This score was generated with the &quot;0.1.1&quot; version of the model.  But once<br />
we deploy the new models, the same request will return:</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">{
  &quot;scores&quot;: {
    &quot;enwiki&quot;: {
      &quot;damaging&quot;: {
        &quot;scores&quot;: {
          &quot;12345678&quot;: {
            &quot;prediction&quot;: false,
            &quot;probability&quot;: {
              &quot;false&quot;: 0.8204647324045306,
              &quot;true&quot;: 0.17953526759546945
            }
          }
        },
        &quot;version&quot;: &quot;0.1.2&quot;
      }
    }
  }
}</pre></div>

<p>Note that the version number changes to &quot;0.1.2&quot; and the probabilities<br />
change slightly.  In this case, we&#039;re essentially re-training the same<br />
model in a similar way, so we increment the &quot;patch&quot; number.</p>

<p>However, we&#039;re switching modeling strategies for the article quality models<br />
(enwiki-wp10, frwiki-wp10 &amp; ruwiki-wp10), so those versions increment the<br />
minor version from &quot;0.3.2&quot; to &quot;0.4.0&quot;.  You may see more substantial<br />
changes in prediction probabilities with those models, but a quick<br />
spot-checking suggests that the changes are not substantial.</p>

<h3 class="remarkup-header">Test statistics and threshholding</h3>

<p>So, many tools that use our edit quality models (reverted, damaging and<br />
goodfaith) will set threshholds for flagging edits for review.  In order to<br />
support these tools, we produce test statistics that suggest useful<br />
thresholds.</p>

<p><a href="https://ores.wmflabs.org/v2/scores/enwiki/damaging/?model_info=test_stats" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wmflabs.org/v2/scores/enwiki/damaging/?model_info=test_stats</a><br />
produces:</p>

<div class="remarkup-code-block" data-code-lang="text" data-sigil="remarkup-code-block"><pre class="remarkup-code">...
      &quot;filter_rate_at_recall(min_recall=0.75)&quot;: {
        &quot;filter_rate&quot;: 0.869,
        &quot;recall&quot;: 0.752,
        &quot;threshold&quot;: 0.492
      },
      &quot;filter_rate_at_recall(min_recall=0.9)&quot;: {
        &quot;filter_rate&quot;: 0.753,
        &quot;recall&quot;: 0.902,
        &quot;threshold&quot;: 0.173
      },
...</pre></div>

<p>These two statistics show useful thresholds for detecting damaging edits.<br />
E.g. if you want to be sure that you catch nearly all vandalism (and are OK<br />
with a higher false-positive rate), set the threshold at 0.173, but if<br />
you&#039;d like to catch most vandalism with almost no false-positives, set the<br />
threshold at 0.492.  These fields can be read automatically by tools so<br />
that they do not need to be manually updated every time that we deploy a<br />
new model.</p>

<p>Let me know if you have any questions and happy hacking!</p>

<p>-Aaron</p></div></content></entry><entry><title>Status update (August 2nd, 2016)</title><link href="/phame/live/8/post/29/status_update_august_2nd_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/29/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:29:15+00:00</published><updated>2017-06-03T17:29:00+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-August/000049.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-August/000049.html</a>)</p>

<p>Hey,</p>

<p>This is the 15th weekly update from revision scoring team that we have sent<br />
to this mailing list.</p>

<p><strong>New developments:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We&#039;ll no longer unnecessarily load the models into memory on the web workers[1].</li>
<li class="remarkup-list-item">We can now score multiple models against the same revision ID for (essentially) free[2].</li>
<li class="remarkup-list-item">Our precaching system will take advantage of this to drop load by about 3X[3].</li>
<li class="remarkup-list-item">Update wmflabs deploy repo for new version of ORES[4].</li>
</ul>

<p><strong>Documentation &amp; maintenance:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We completed deployment and maintenance docs for Wiki labels[5], which means we&#039;ve now got complete docs for our systems[6].</li>
<li class="remarkup-list-item">We implemented basic continuous integration tests for the ORES extension[7].</li>
</ul>

<p><strong>Downtime:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We had a 1 hour long downtime while trying to deploy new code to ores.wikimedia.org[8].  We&#039;ve filed two critical tasks for making sure we don&#039;t make the mistake again[9,10].</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134606" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_726"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_725" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134606</span></a> - Score multiple models with the same cached dependencies</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T139407" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_728"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_727" aria-hidden="true"></span>https://phabricator.wikimedia.org/T139407</span></a> - Don&#039;t load models into memory of web workers</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T141376" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_730"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_729" aria-hidden="true"></span>https://phabricator.wikimedia.org/T141376</span></a> - Update precached to group requests by model</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T141377" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_732"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_731" aria-hidden="true"></span>https://phabricator.wikimedia.org/T141377</span></a> - Update wmflabs deploy repo for new version of ORES</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T131768" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_734"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_733" aria-hidden="true"></span>https://phabricator.wikimedia.org/T131768</span></a> - Wikilabels deployment docs</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T106271" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_736"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_735" aria-hidden="true"></span>https://phabricator.wikimedia.org/T106271</span></a> - Document maintenance tasks</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T140455" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_738"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_737" aria-hidden="true"></span>https://phabricator.wikimedia.org/T140455</span></a> - CI test for ORES extension</li>
<li class="remarkup-list-item"><a href="https://wikitech.wikimedia.org/wiki/Incident_documentation/20160801-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://wikitech.wikimedia.org/wiki/Incident_documentation/20160801-ORES</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T141823" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_740"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_739" aria-hidden="true"></span>https://phabricator.wikimedia.org/T141823</span></a> - Set up password on ORES Beta redis server</li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T141825" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_742"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_741" aria-hidden="true"></span>https://phabricator.wikimedia.org/T141825</span></a> - Config beta ORES extension to use the beta ORES service</li>
</ol>

<p>Sincerely,<br />
Aaron from the Revision Scoring team</p>

<p><strong>Edit:</strong> Note that when I copied this post, I forgot to copy the followups from the same month.  See them all here:</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Aug 8th: <a href="https://lists.wikimedia.org/pipermail/ai/2016-August/000052.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-August/000052.html</a></li>
<li class="remarkup-list-item">Aug 16th: <a href="https://lists.wikimedia.org/pipermail/ai/2016-August/000066.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-August/000066.html</a></li>
<li class="remarkup-list-item">Aug 22nd: <a href="https://lists.wikimedia.org/pipermail/ai/2016-August/000070.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-August/000070.html</a></li>
<li class="remarkup-list-item">Aug 29th: <a href="https://lists.wikimedia.org/pipermail/ai/2016-August/000080.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-August/000080.html</a></li>
</ul></div></content></entry><entry><title>Status update (July 6th, 2016)</title><link href="/phame/live/8/post/28/status_update_july_6th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/28/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:11:53+00:00</published><updated>2017-06-03T16:11:53+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-July/000039.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-July/000039.html</a>)</p>

<p>Hey,<br />
This is the 11th weekly update from revision scoring team that we have sent<br />
to this mailing list.</p>

<p>*New developments:*</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">ORES review tool as a beta feature is enabled in Dutch Wikipedia. More wikis to come soon this week [1].</li>
<li class="remarkup-list-item">We have basic edit quality model for Czech Wikipedia ready and merged. To be deployed this week [2].</li>
<li class="remarkup-list-item">We also have basic models for English Wiktionary too. This is the second non-Wikipedia project we support after Wikidata [3].</li>
<li class="remarkup-list-item">Thanks to Tar Lócesilion, we have Polish edit quality campaign completed, We are working on building damaging and goodfaith models at the moment [4].</li>
</ul>

<p>*Maintenance and robustness:*</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We decreased our web capacity in order to reduce memory pressure on scb nodes. You should not get any overload error since our capacity is still very high but if you do, please contact us immediately and we will bring it back up [5].</li>
<li class="remarkup-list-item">We improved documentation on ores.wikimedia.org page a little bit. To be deployed this week [6].</li>
</ul>

<p>We are working on a rather big refactor on ores which will give us<br />
performance boost on scoring multiple models at the same time [7] and<br />
reduce memory usage [8]. Feel free to chime in and give us feedback [9].</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T139432" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_744"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_743" aria-hidden="true"></span>https://phabricator.wikimedia.org/T139432</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T138885" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_746"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_745" aria-hidden="true"></span>https://phabricator.wikimedia.org/T138885</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T138630" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_748"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_747" aria-hidden="true"></span>https://phabricator.wikimedia.org/T138630</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130269" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_750"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_749" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130269</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T139177" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_752"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_751" aria-hidden="true"></span>https://phabricator.wikimedia.org/T139177</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T138089" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_754"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_753" aria-hidden="true"></span>https://phabricator.wikimedia.org/T138089</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134606" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_756"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_755" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134606</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T139407" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_758"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_757" aria-hidden="true"></span>https://phabricator.wikimedia.org/T139407</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T139408" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_760"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_759" aria-hidden="true"></span>https://phabricator.wikimedia.org/T139408</span></a></li>
</ol>

<p>Sincerely,<br />
Amir from the Revision Scoring team.</p></div></content></entry><entry><title>ORES going into production (June 22, 2016)</title><link href="/phame/live/8/post/27/ores_going_into_production_june_22_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/27/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:10:18+00:00</published><updated>2017-06-03T16:13:31+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-June/000036.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-June/000036.html</a>)</p>

<p>Hey folks,</p>

<p>We (The Revision Scoring Team[1]) are happy to announce the deployment of<br />
the ORES service[2] in production at a new address:<br />
<a href="https://ores.wikimedia.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wikimedia.org</a>.  This will replace the old Wikimedia Labs<br />
address soon: <a href="https://ores.wmflabs.org" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores.wmflabs.org</a>.  Along with this new location, we<br />
are running on more predictable infrastructure that will allow us to<br />
increase our uptime and to make the service available to MediaWiki and<br />
extensions directly.</p>

<p>We&#039;ve also begun deploying the ORES review tool[3] as a beta feature on<br />
Wikidata and Persian Wikipedia in order trial the fully integrated<br />
extension.  Once enabled, the ORES review tool highlights edits that are<br />
likely to be damaging in Special:RecentChanges to help you prioritize your<br />
patrolling work. ORES is an experimental technology.  We encourage you to<br />
take advantage of it but also to be skeptical of the predictions made.<br />
Please reach out to us with your questions and concerns.</p>

<p>We&#039;ll soon begin to deploy the ORES review tool to more wikis.  Next up are<br />
English, Portuguese, Russian, Dutch and Turkish Wikipedias.  We can deploy<br />
to these wikis because those communities have completed Wiki labels[4]<br />
campaigns that help train ORES&#039; classifiers to differentiate good-faith<br />
mistakes from vandalism.  If you&#039;d like to get the ORES review tool<br />
deployed in your wiki, please reach out to us for help setting up or<br />
completing a Wiki labels campaign on your local wiki.  Wikimania<br />
participants can also attend our workshop[5] during the hackathon to get<br />
setting up ORES  for your local wiki.</p>

<p><strong>Documentation:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/ORES_review_tool" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/ORES_review_tool</a></li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/Extension:ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/Extension:ORES</a></li>
<li class="remarkup-list-item"><a href="https://meta.wikimedia.org/wiki/Objective_Revision_Evaluation_Service" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Objective_Revision_Evaluation_Service</a></li>
</ul>

<p><strong>Bugs &amp; feature requests:</strong> <a href="https://phabricator.wikimedia.org/tag/revision-scoring-as-a-service-backlog/" class="remarkup-link" rel="noreferrer">https://phabricator.wikimedia.org/tag/revision-scoring-as-a-service-backlog/</a><br />
<strong>IRC:</strong> #wikimedia-ai[6]</p>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service#Team" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service#Team</a></li>
<li class="remarkup-list-item"><a href="https://meta.wikimedia.org/wiki/Objective_Revision_Evaluation_Service" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Objective_Revision_Evaluation_Service</a></li>
<li class="remarkup-list-item"><a href="https://www.mediawiki.org/wiki/ORES_review_tool" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://www.mediawiki.org/wiki/ORES_review_tool</a></li>
<li class="remarkup-list-item"><a href="https://meta.wikimedia.org/wiki/Wiki_labels" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Wiki_labels</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134628" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_762"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_761" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134628</span></a></li>
<li class="remarkup-list-item"><a href="https://webchat.freenode.net/?channels=#wikimedia-ai" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://webchat.freenode.net/?channels=#wikimedia-ai</a></li>
</ol>

<p>Stay tuned for an update about deprecation of ores.wmflabs.org and<br />
announcements of support for new wikis.  Please feel free to reach out to<br />
us with any questions/ideas.</p>

<p>-Aaron</p></div></content></entry><entry><title>Status update (June 14th, 2016)</title><link href="/phame/live/8/post/26/status_update_june_14th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/26/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T16:03:11+00:00</published><updated>2017-06-03T16:04:36+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-June/000033.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-June/000033.html</a>)</p>

<p>Hi folks,</p>

<p>This is the 8th weekly update for the revision scoring team that we have<br />
sent to this mailing list.</p>

<p><strong>New developments:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">ORES extension got deployed in Persian Wikipedia. [1] Give it a try![2]</li>
<li class="remarkup-list-item">Article quality model (&quot;wp10&quot; model) is now working for Russian Wikipedia. [3] It will be deployed this week</li>
<li class="remarkup-list-item">We deplyed article topic campaign for English Wikipedia [4]</li>
<li class="remarkup-list-item">ores.wikimedia.org does have a grafana dashboard now [5]</li>
</ul>

<p><strong>Maintenance and robustness:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">ORES icinga didn&#039;t work when workers were down[13], it got fixed [6]</li>
<li class="remarkup-list-item">We finished load testing ores.wikimedia.org and it was quite alright [7] [8]</li>
<li class="remarkup-list-item">CORS is moved to uwsgi level to it works in prod too [9]</li>
<li class="remarkup-list-item">Deploying new versions of ORES in prod and labs has a proper documentation page now [10] [11]</li>
<li class="remarkup-list-item">We had intermittent spikes of errored revisions, got it resolved [12]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130211" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_764"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_763" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130211</span></a></li>
<li class="remarkup-list-item"><a href="https://fa.wikipedia.org/wiki/Special:Preferences#mw-prefsection-betafeatures" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://fa.wikipedia.org/wiki/Special:Preferences#mw-prefsection-betafeatures</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T131635" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_766"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_765" aria-hidden="true"></span>https://phabricator.wikimedia.org/T131635</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137325" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_768"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_767" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137325</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137367" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_770"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_769" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137367</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137592" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_772"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_771" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137592</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137365" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_774"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_773" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137365</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137131" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_776"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_775" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137131</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137433" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_778"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_777" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137433</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137570" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_780"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_779" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137570</span></a></li>
<li class="remarkup-list-item"><a href="https://wikitech.wikimedia.org/wiki/Ores/Deployment" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://wikitech.wikimedia.org/wiki/Ores/Deployment</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134109" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_782"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_781" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134109</span></a></li>
<li class="remarkup-list-item"><a href="https://wikitech.wikimedia.org/wiki/Incident_documentation/20160610-ORES" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://wikitech.wikimedia.org/wiki/Incident_documentation/20160610-ORES</a></li>
</ol>

<p>Best,<br />
Amir</p></div></content></entry><entry><title>Status update (June 7th, 2016)</title><link href="/phame/live/8/post/25/status_update_june_7th_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/25/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T15:54:54+00:00</published><updated>2017-06-03T15:54:54+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-June/000032.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-June/000032.html</a>)</p>

<p>Hey folks,</p>

<p>This is the 7th weekly update for the revision scoring team that we have<br />
sent to this mailing list.</p>

<p><strong>New developments:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">The production version of ORES (ores.wikimedia.org) is live! We are still testing and it might break sometimes. [1]</li>
<li class="remarkup-list-item">Norwegian basic support is completed. We deploy this very soon. [2] [3]</li>
<li class="remarkup-list-item">ores-experiment.wmflabs.org is a new setup to run experimental models.  [4]</li>
<li class="remarkup-list-item">We implemented a demo of dependent task in celery, our distributed processing environment [5] which brings us closer to a key performance improvement [6]</li>
</ul>

<p><strong>Maintenance and robustness:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We deprecated flower in favor of grafana ( <a href="https://grafana.wikimedia.org/dashboard/db/ores" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://grafana.wikimedia.org/dashboard/db/ores</a> ) [7]</li>
<li class="remarkup-list-item">Intermittent performance issues with wikilabels are now solved [8]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T106867" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_784"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_783" aria-hidden="true"></span>https://phabricator.wikimedia.org/T106867</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T131856" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_786"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_785" aria-hidden="true"></span>https://phabricator.wikimedia.org/T131856</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T131855" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_788"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_787" aria-hidden="true"></span>https://phabricator.wikimedia.org/T131855</span></a></li>
<li class="remarkup-list-item"><a href="https://ores-experiment.wmflabs.org/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://ores-experiment.wmflabs.org/</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T136875" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_790"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_789" aria-hidden="true"></span>https://phabricator.wikimedia.org/T136875</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134606" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_792"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_791" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134606</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T137003" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_794"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_793" aria-hidden="true"></span>https://phabricator.wikimedia.org/T137003</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130872" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_796"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_795" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130872</span></a></li>
</ol>

<p>Sincerly<br />
-Amir</p></div></content></entry><entry><title>Status update (May 23rd, 2016)</title><link href="/phame/live/8/post/24/status_update_may_23rd_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/24/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T15:52:11+00:00</published><updated>2017-06-03T15:52:11+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-May/000030.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-May/000030.html</a>)</p>

<p>Hey,<br />
This is the fifth weekly update for revision scoring team that we sent to<br />
this mailing list.</p>

<p><strong>New developments:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We got Swedish basic model ready to deloy, likely to happen in the next week [1] [2]</li>
<li class="remarkup-list-item">We generated list of bad words for every Wikipedia with more than 100K articles (with a few exceptions) [3]</li>
</ul>

<p><strong>Maintenance and robustness:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">We enabled CORS for Wikimedia wikis in wikilabels and now we won&#039;t let you do write actions via GET [4]</li>
<li class="remarkup-list-item">We are using systemd watchdogs in precaching to be sure it stays alive. [5]</li>
<li class="remarkup-list-item">We are changing some settings in nginx and uwsgi in order to finalize moving to prod [6]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T131450" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_798"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_797" aria-hidden="true"></span>https://phabricator.wikimedia.org/T131450</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T135604" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_800"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_799" aria-hidden="true"></span>https://phabricator.wikimedia.org/T135604</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134629" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_802"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_801" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134629</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T135377" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_804"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_803" aria-hidden="true"></span>https://phabricator.wikimedia.org/T135377</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T135941" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_806"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_805" aria-hidden="true"></span>https://phabricator.wikimedia.org/T135941</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T135655" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_808"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_807" aria-hidden="true"></span>https://phabricator.wikimedia.org/T135655</span></a></li>
</ol>

<p>Sincerely,<br />
The Revision scoring team</p></div></content></entry><entry><title>Status update (May 10, 2016)</title><link href="/phame/live/8/post/23/status_update_may_10_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/23/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T15:49:47+00:00</published><updated>2017-06-03T15:52:56+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-May/000026.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-May/000026.html</a>)</p>

<p>Hey folks,</p>

<p>This is the weekly update for the Revision Scoring project for the week of<br />
May 2nd through May 8th.</p>

<p><strong>New developments:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Solved some issues that block a major performance improvement for score requests using multiple models[2]</li>
<li class="remarkup-list-item">Improved the performance of feature extraction for features that use mwparserfromhell[3,4]</li>
<li class="remarkup-list-item">We applied regex performance optimizations to badwords and informal word detection for many languages[9]</li>
</ul>

<p><strong>Maintenance and robustness:</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Solved a regression in ScoredRevisions that caused most revisions in RecentChanges to not be scored[1]</li>
<li class="remarkup-list-item">Set ORES load balancer to rebalance on 500 responses from a web node[5]</li>
<li class="remarkup-list-item">Enabled CORS for error responses from ORES -- this makes it easier to report errors from a gadget on a wiki[6]</li>
<li class="remarkup-list-item">Sade the staging instance of Wikilabels[7] look a lot more like the production instance[8]</li>
</ul>



<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134601" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_810"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_809" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134601</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134781" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_812"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_811" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134781</span></a></li>
<li class="remarkup-list-item"><a href="https://mwparserfromhell.readthedocs.io" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://mwparserfromhell.readthedocs.io</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134780" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_814"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_813" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134780</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T111806" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_816"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_815" aria-hidden="true"></span>https://phabricator.wikimedia.org/T111806</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T119325" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_818"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_817" aria-hidden="true"></span>https://phabricator.wikimedia.org/T119325</span></a></li>
<li class="remarkup-list-item"><a href="https://labels-staging.wmflabs.org/gadget/" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://labels-staging.wmflabs.org/gadget/</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134627" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_820"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_819" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134627</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134267" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_822"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_821" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134267</span></a></li>
</ol>

<p>Stay tuned!<br />
--Aaron</p></div></content></entry><entry><title>Status update (May 2nd, 2016)</title><link href="/phame/live/8/post/22/status_update_may_2nd_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/22/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T15:46:15+00:00</published><updated>2017-06-03T15:46:15+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-May/000022.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-May/000022.html</a>)</p>

<p>The second weekly update on the Revision Scoring project.</p>

<p><strong>New developments</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">ORES has graphite dashboard now [1,2]</li>
<li class="remarkup-list-item">Deploying new campaigns and testing Wikilabels got easier [3,4]</li>
<li class="remarkup-list-item">Revscoring feature extraction got about 13% faster [5]</li>
<li class="remarkup-list-item">We deployed new versions of ORES and Wikilabels [6,7]</li>
<li class="remarkup-list-item">Wikidata ScoredRevision gadget had a serious issue, it got fixed [8]</li>
</ul>

<p><strong>Progress in supporting new languages</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Wikidata damaging and goodfaith models are built and deployed [9,10,11]</li>
<li class="remarkup-list-item">Dutch damaging and goodfaith models are built and deployed [12]</li>
<li class="remarkup-list-item">We are working on langauge utilities of Tamil [13]</li>
</ul>

<p><strong>Active Labeling campaigns</strong></p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Edit quality (damaging and good faith)</li>
<li class="remarkup-list-item">Wikipedias: Arabic, Azerbaijani, German, French, Hebrew, Hungarian, Indonesian, Italian, Japanese, Norweigian, Persian (v2), Polish, Spanish, Ukranian, Urdu, Vietnamese</li>
<li class="remarkup-list-item">Edit type</li>
<li class="remarkup-list-item">English Wikipedia</li>
</ul>

<ol class="remarkup-list">
<li class="remarkup-list-item"><a href="https://grafana.wikimedia.org/dashboard/db/ores" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://grafana.wikimedia.org/dashboard/db/ores</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T127594" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_824"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_823" aria-hidden="true"></span>https://phabricator.wikimedia.org/T127594</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T133557" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_826"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_825" aria-hidden="true"></span>https://phabricator.wikimedia.org/T133557</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T102336" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_828"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_827" aria-hidden="true"></span>https://phabricator.wikimedia.org/T102336</span></a></li>
<li class="remarkup-list-item"><a href="https://github.com/wiki-ai/revscoring/pull/268" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://github.com/wiki-ai/revscoring/pull/268</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134032" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_830"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_829" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134032</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134174" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_832"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_831" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134174</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T133903" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_834"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_833" aria-hidden="true"></span>https://phabricator.wikimedia.org/T133903</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130274" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_836"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_835" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130274</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T130301" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_838"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_837" aria-hidden="true"></span>https://phabricator.wikimedia.org/T130301</span></a></li>
<li class="remarkup-list-item"><a href="https://lists.wikimedia.org/pipermail/wikidata/2016-May/008641.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/wikidata/2016-May/008641.html</a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T133563" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_840"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_839" aria-hidden="true"></span>https://phabricator.wikimedia.org/T133563</span></a></li>
<li class="remarkup-list-item"><a href="https://phabricator.wikimedia.org/T134105" class="phui-tag-view phui-tag-type-shade phui-tag-blue phui-tag-shade phui-tag-icon-view " data-sigil="hovercard" data-meta="0_842"><span class="phui-tag-core "><span class="visual-only phui-icon-view phui-font-fa fa-anchor" data-meta="0_841" aria-hidden="true"></span>https://phabricator.wikimedia.org/T134105</span></a></li>
</ol></div></content></entry><entry><title>Status update (April 25, 2016)</title><link href="/phame/live/8/post/20/status_update_april_25_2016/" /><id>https://phabricator.wikimedia.org/phame/post/view/20/</id><author><name>Halfak (Aaron Halfaker, EpochFail, halfak)</name></author><published>2017-06-03T15:40:53+00:00</published><updated>2017-06-03T15:49:59+00:00</updated><content type="xhtml"><div xmlns="http://www.w3.org/1999/xhtml"><p>(This post was copied from <a href="https://lists.wikimedia.org/pipermail/ai/2016-April/000019.html" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://lists.wikimedia.org/pipermail/ai/2016-April/000019.html</a>)</p>

<p>Hello, This is our first weekly update being posted in this mailing list</p>

<p>New Developments</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Now you can abandon tasks you don&#039;t want to review in Wikilabels (<a href="https://phabricator.wikimedia.org/T105521" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_843"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T105521</span></span></a>)</li>
<li class="remarkup-list-item">We collect user-agents in ORES requests (<a href="https://phabricator.wikimedia.org/T113754" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_844"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T113754</span></span></a>)</li>
<li class="remarkup-list-item">Precaching in ORES will be a daemon and more selective (<a href="https://phabricator.wikimedia.org/T106638" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_845"><span class="phui-tag-core-closed"><span class="phui-tag-core phui-tag-color-object">T106638</span></span></a>)</li>
</ul>

<p>Progress in supporting new languages</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Russian reverted, damaging, and goodfaith models are built. They look good and will be deployed this week.</li>
<li class="remarkup-list-item">Hungarian reverted model is built, will be deployed this week. Campaign for goodfaith and damaging is loaded in Wikilabels.</li>
<li class="remarkup-list-item">Japanese reverted model are built, but there are still some issues to work out. (<a href="https://phabricator.wikimedia.org/T133405" class="phui-tag-view phui-tag-type-object " data-sigil="hovercard" data-meta="0_846"><span class="phui-tag-core phui-tag-color-object">T133405</span></a>)</li>
</ul>

<p>Active Labeling campaigns</p>

<ul class="remarkup-list">
<li class="remarkup-list-item">Edit quality (damaging and good faith)</li>
<li class="remarkup-list-item">Wikipedias: Arabic, Azerbaijani, Dutch, German, French, Hebrew, Hungarian, Indonesian, Italian, Japanese, Norwegian, Persian (v2), Polish, Spanish, Ukrainian, Urdu, Vietnamese<ul class="remarkup-list">
<li class="remarkup-list-item">Wikidata</li>
</ul></li>
<li class="remarkup-list-item">Edit type</li>
<li class="remarkup-list-item">English Wikipedia</li>
</ul>

<p>Sincerely,<br />
The Revision Scoring team.<br />
<a href="https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service#Team" class="remarkup-link remarkup-link-ext" rel="noreferrer">https://meta.wikimedia.org/wiki/Research:Revision_scoring_as_a_service#Team</a></p></div></content></entry></feed>