Page MenuHomePhabricator

Determine language support for Tone Check (v1)
Closed, ResolvedPublic

Description

This task involves the work for determining what languages the first version of Peacock Check will support. This will in turn affect the wikis we approach to be partners for this work (T387921).

Selection criteria

We are seeking languages/wikis where:

  1. The model – as currently conceived – will perform with high enough precision for volunteers seeing Peacock Check(s) to consider them reliable/useful
  2. Newcomers and junior contributors publish new content containing peacock language at a high enough rate for experienced volunteers to perceive this as an important issue to address
  3. Training data is accessible enough for evaluating the model to be relatively straightforward

Selection process

  1. Verify cost of gathering training/evaluation data for languages previous BERT model considered (see "3." above),
  2. Evaluate model performance on languages for which training/evaluation data is relatively low-cost/effort, and see which languages are probably launch-ready vs. which languages would require us to update the model
  3. For each of the languages that would require an update to the model, get a sense for how often peacock edits get reverted. This will help us prioritize whether to update the model, or just launch with the launch-ready languages

Languages to evaluate

Peacock Check/Language Selection (v1)

Related Objects

Event Timeline

Aklapper renamed this task from Determine language support for Peacock Check (v1) to Determine language support for Tone Check (v1).May 28 2025, 11:43 AM
ppelberg claimed this task.

Initially, Tone Check will support the following five languages: English, French, Japanese, Spanish, and Portuguese.

Please see T388471#10781906 for details about how and these languages were prioritized to start. Note: the goal remains for Tone Check to support all languages.

The above was published on-wiki here: https://www.mediawiki.org/w/index.php?diff=7737825