Page MenuHomePhabricator

Improve MT support for Tsonga with OpusMT
Closed, ResolvedPublic

Description

Now that OpusMT is integrated in Content Translation (T234194), we are looking for languages with no previous Machine Translation (MT) support that could benefit from it. OpusMT is an opensource neural machine translation system that is trained with freely licensed multilingual contents (including articles created with Content Translation).

Enabling OpusMT is an experimental process since the initial quality is expected to be low, but since it improves as more translations are created, it is also a way for the community to improve the MT support for their language by translating Wikipedia articles using Content Translation. So it may be an interesting process for communities to engage in.

In the initial iteration we enabled OpusMT for Assamese, and Tsonga seems a good candidate to consider next since it is an active community with no current MT support. In the same way as we did with Assamese, we want to involve the Tsonga community. We only plan enable OpusMT is there are no concerns from the community. Based on the observations of how the process goes, we'll be open to adjust (e.g., make more strict the limits for publishing) or disable the system as needed.

Event Timeline

Pginer-WMF renamed this task from Improve MT support for Breton with OpusMT to Improve MT support for Tsonga with OpusMT.Dec 10 2020, 2:28 PM
Pginer-WMF updated the task description. (Show Details)
Pginer-WMF added a subscriber: KartikMistry.

@KartikMistry can you add English-Tsonga model to our OpusMT instance so that community members can give it a try?

@KartikMistry can you add English-Tsonga model to our OpusMT instance so that community members can give it a try?

Done.

Pginer-WMF raised the priority of this task from Medium to High.

Change 650084 had a related patch set uploaded (by KartikMistry; owner: KartikMistry):
[mediawiki/services/cxserver@master] config: Enable en->ts MT pair for OpusMT

https://gerrit.wikimedia.org/r/650084

Change 650084 merged by jenkins-bot:
[mediawiki/services/cxserver@master] config: Enable en->ts MT pair for OpusMT

https://gerrit.wikimedia.org/r/650084

Change 650103 had a related patch set uploaded (by KartikMistry; owner: KartikMistry):
[operations/deployment-charts@master] Update cxserver to 2020-12-17-111820-production

https://gerrit.wikimedia.org/r/650103

Change 650103 merged by jenkins-bot:
[operations/deployment-charts@master] Update cxserver to 2020-12-17-111820-production

https://gerrit.wikimedia.org/r/650103

Mentioned in SAL (#wikimedia-operations) [2020-12-17T11:38:41Z] <kart_> Updated cxserver to 2020-12-17-111820-production (T262192)