Page MenuHomePhabricator

Extend evaluation data to include Chinese Wikipedia
Closed, ResolvedPublic

Description

The existing evaluation for the model does so across five languages: Arabic, English, French, Hungarian, and Turkish. Chinese Wikipedia (zhwiki) now also uses the PageAssessments extension from which we retrieve the groundtruth data, so we can add it to that list!

Adding it will require:

Resources:

NOTE: not all of the existing quality ratings for zhwiki will be translate-able into the English quality classes we're using (stub/start/c/b/ga/fa) but we should hopefully be able to get examples for each of these classes and not throw away too much data.