Across all languages, Wikipedia articles created with Content Translation are deleted less often than those created from scratch. However, that is not always the case for all languages.
We started to collect data about deletions (T286636) which provides an overview about the Wikipedias where translations are deleted more often than other articles. This can be a good indicator of issues with the tool on those particular languages to research further and/or adjust the current quality limits for those wikis to encourage users to review contents further before publishing.
This ticket proposes to define a criteria based on the above data on the kind of adjustments to do. This could look as something like this:
Make the limits for a wiki 10% more strict when the wiki:
- Appears in the list of wikis with high deletion more than once in the past 4 quarters.
- Deletion rate difference is more than 5%
- CX Deletion rate is over 10%
- Number of CX articles is over 50 for a quarter.
The above is just an example to try to focus on the cases where the issues may be happening more consistently and not due to data noise (e.g., small number of articles resulting in a high deletion percentage) or exceptional events (e.g. one user vandalizing)
Task completed
The criteria for adjusting Machine Translation limits in Content Translation based on data provided is documented here.