Although for most cases, the articles created with Content Translation have lower deletion ratios compared to new articles created from scratch (During 2019: 5% vs. 11% across all languages), for certain languages the story can be different.
In some cases, the articles created with Content Translation may be less likely to survive than those created from scratch. For example, for Indonesian (T219851#5914691) and Telugu (T244769) the deletion ratios for Content Translation were higher compared to other articles created in these wikis. These cases can be addressed by adjusting the translation limits, but we don't have a systematic way to identify such cases until editors report them.
This ticket proposes to generate a list of Wikipedias showing their deletion ratios for new articles created with and without Content Translation. The list will surface the wikis where the deletion ratio for Content Translation is higher than usual. The measurement should capture a long-enough period of time to allow for editors to review content and avoid seasonality.
This can be supported by a query with the following information:
Language | New CX articles | New non-CX articles | Deleted CX articles | Deleted non-CX articles | Deleted CX % | Deleted non-CX % | Deletion % difference (scratch - CX) |
---|---|---|---|---|---|---|---|
TE | ... | ... | ... | ... | 22% | 15% | -7% |
(Results could be ordered by the "Deletion ratio difference" to identify those cases with larger gaps)
Indicidual queries wre created for generating this report.