✩ Status update (September 28th, 2016)

Status update (September 28th, 2016)

September 28th, 2016

Hey,

This is the 23rd weekly update from revision scoring team that we have sent
to this mailing list.

New development

We implemented and demonstrated a linguistic/stylometric processing strategy that should give us more signal for finding vandalism and spam[1]. See the discussion on the AI list[2].
As part of our support for the Collaboration Team, we've been producing tables of model statistics that correspond to set of thresholds[3]. This helps their designers work on strategies for reporting prediction confidence in an intuitive way.

Maintenance and robustness

We had a major downtime event that was caused by our logs being too verbose. We've recovered and turned down the log level[4].
We made sure that halfak got pings when ores.wikimedia.org goes down[5]

Datasets

We created a database on Wikimedia Labs that provides access to a dataset containing a complete set of article quality predictions for English Wikipedia[6]. See our announcements[7,8,9].

Sincerely,
Aaron from the Revision Scoring team

Written by Halfak on Jun 3 2017, 5:12 PM.

Principal Research Scientist

Projects

Subscribers

None