Status update (October 24th, 2016)
October 24th, 2016

(This post was copied from https://lists.wikimedia.org/pipermail/ai/2016-October/000111.html)

Hey,

This is the 26th and 27th weekly update from revision scoring team that we
have sent to this mailing list. We forgot to send the update for last week!

Last week, we were featured in Research's quarterly review. In the last 3
months, we achieved our goals to expand the ORES extension to 6 wikis (we
made it to 8!) and to release datasets of article quality predictions. The
minutes from the quarterly review are not yet online, but once they are,
you'll be able to see them at [1].

Maintenance and robustness:

  • We discussed and decided on a set of strategies for handling goodfaith/naive DOS attacks on ORES[2]
  • We fixed an i18n issue in Wiki Labels[3]
  • We updated the article quality models (wikiclass/wp10) to use revscoring 1.3.0[4]
  • We investigated and solved a memory leak in our pre-caching utility[5]
  • We configured celery to send its logs to a place where we can read them for easier debugging[6]
  • We deployed a set of schema changes to constrain the ORES Review Tools database appropriately[7]
  • Also worth noting is that the services cluster (SCB) has been expanded[8]. ORES has now doubled in capacity

Datasets

  • We discussed how to make the historical article quality dataset available via quarry[8]. Regretfully, it seems that we'll not be able to do that for at least a couple of months.

New development

  • We've implemented embedding of machine-readable scores in a JS variable on-wiki[9]. This will make it easier for tool developers to experiment with new ways of displaying Special:RecentChanges more easily. It's also a necessary precondition for adding color-based signaling of ORES' confidence about an edit.
  1. https://meta.wikimedia.org/wiki/Wikimedia_Foundation_metrics_and_activities_meetings/Quarterly_reviews/Research,_Design_Research,_Analytics,_and_Performance,_October_2016
  2. https://phabricator.wikimedia.org/T148347 -- [Discuss] DOS attacks on ORES. What to do?
  3. https://phabricator.wikimedia.org/T139587 -- Revision not found error unformatted and not localized
  4. https://phabricator.wikimedia.org/T147201 -- Update wikiclass for revscoring 1.3.0
  5. https://phabricator.wikimedia.org/T146500 -- Investigate memory leak in precached
  6. https://phabricator.wikimedia.org/T147898 -- Send celery logs to /srv/log/ores instead of /var/lib/daemon.log
  7. https://phabricator.wikimedia.org/T147734 -- Review and deploy 309825
  8. https://phabricator.wikimedia.org/T147903 -- Expand SCB cluster
  9. https://phabricator.wikimedia.org/T146718 -- [Discuss] Hosting the monthly article quality dataset on labsDB
  10. https://phabricator.wikimedia.org/T143611 -- Embed machine readable ores scores as data on pages where ORES scores things

Sincerely,
Aaron from the Revision Scoring team

Written by Halfak on Jun 3 2017, 5:16 PM.
Principal Research Scientist
Projects
Subscribers
None