In T388471, we will be inviting volunteers to review the Peacock Check language model using Annotool.
This task represents the work of updating Annotool to enable the above.
Requirements
- Annotool is updated so that each edit/diff includes a free-text field wherein volunteers can offer context about what language they consider to be non-neutral within a given edit/diff
- Annotool instances are created for each of the languages listed in the === Language instances section below
- Note: the Machine Learning Team will be updating T388471 as Eval Data is ready for inclusion in Annotool.
- Each Annotool instance will include 100 samples for languages with 3-5 evaluators, and 150 samples for languages with 6+ evaluators.
Language instances
| Wiki | Language | Status | Link | Notes |
|---|---|---|---|---|
| ar.wiki | Arabic | |||
| cs.wiki | Czech | |||
| de.wiki | German | |||
| en.wiki | English | https://annotool.toolforge.org/projects/13 | ||
| es.wiki | Spanish | https://annotool.toolforge.org/projects/14 | ||
| fa.wiki | Persian | |||
| fr.wiki | French | https://annotool.toolforge.org/projects/16 | ||
| he.wiki | Hebrew | |||
| id.wiki | Indonesian | |||
| it.wiki | Italian | |||
| ja.wiki | Japanese | https://annotool.toolforge.org/projects/15 | ||
| nl.wiki | Dutch | |||
| no.wiki | Norwegian Bokmål | |||
| pl.wiki | Polish | |||
| pt.wiki | Portuguese | https://annotool.toolforge.org/projects/17 | ||
| ro.wiki | Romanian | |||
| ru.wiki | Russian | |||
| tr.wiki | Turkish | |||
| uk.wiki | Ukrainian | |||
| zh.wiki | Chinese | |||
Evaluator experience
- An evaluator who signs up in T388471 will receive a link to the Annotool instance that pertains to the language they will be evaluating.
- The evaluator reads the labelling instructions included in the Annotool instance.
- The evaluator receives samples in a random order, and they are asked to review at least 30 samples. Evaluators are encouraged to review all of the samples in the dataset if they are able to, as this increases the chances that each sample will have multiple evaluators.
- Once the evaluator has completed their labelling, they notify us in T388471.