Page MenuHomePhabricator

Request to Add Machine Translation Support for Karakalpak Language in Content Translation
Open, Needs TriagePublicFeature

Description

Dear Wikimedia Language Engineering Team,

I am an active contributor to the Karakalpak Wikipedia and would like to request that machine translation support for the Karakalpak language be added to the Content Translation tool.

Currently, when I attempt to translate articles from Russian or other languages into Karakalpak, no machine translation is provided, and the original source text remains unchanged. This significantly slows down the translation process and discourages new contributors from participating.

I understand that machine translation is typically supported via services such as Apertium or Google Translate. While Karakalpak is not yet supported directly by these systems, I believe that its linguistic similarity to Uzbek and Kazakh, as well as the growing number of Karakalpak-language resources online, can help build a basic system or allow adaptation via existing engines.

I am ready to assist in testing or helping improve Karakalpak translations if support is enabled.

Thank you for considering this request. Support for Karakalpak will greatly benefit our small but passionate community of editors and help preserve and promote our language.

Best regards,
Srajatdin Usnatdinov
Active Karakalpak Wikipedia Contributor

Event Timeline

jhsoby lowered the priority of this task from Unbreak Now! to Needs Triage.Apr 29 2025, 9:12 AM
jhsoby edited projects, added ContentTranslation, LPL Essential; removed translatewiki.net.

(Please leave priority changes to the teams in charge of working on tasks.)

We do have a machine translation system for kaa - Karakalpak powered by MADLAD-400 model.

Can be tried https://huggingface.co/spaces/santhosh/madlad400-3b-ct2 or https://translate.wmcloud.org/. We have not evaluated its output quality with native speakers yet.

Are there any alternative opensource models you are aware of?