Page MenuHomePhabricator

Adjust Google Configuration to expose Cantonese MT instead of Chinese
Closed, ResolvedPublic2 Estimated Story Points

Description

Initially, Google Translate did not have support for Cantonese. In order to help translators with what was available, the support for Chinese (with Traditional script) from Google was exposed when users translate into Cantonese (T258919). However, feedback from the community suggested it was not actually useful and MT was disabled from Cantonese (T333835).

Now, Google supports Cantonese and we enabled it in Content Translation based on input form the community (T333835#10400500). However, it seems that the Chinese (with Traditional script) is what is still exposed in content Translation. I include some examples below to illustrate this:

Content TranslationGoogle Translate translation to Cantonese (yue)Google Translate translation to Chinese with traditional script (zh-tw)
zh-yue.m.wikipedia.org_w_index.php_title=Special_ContentTranslation&active-list=suggestions&from=en&to=yue&page=Chocolate(Wiki Mobile).png (568×320 px, 46 KB)
translate.google.com__sl=en&tl=yue&text=Chocolate&op=translate(Wiki Mobile).png (568×320 px, 36 KB)
translate.google.com__sl=en&tl=yue&text=Chocolate&op=translate(Wiki Mobile) (1).png (568×320 px, 37 KB)

The expected result would be for content Translation to use the Cantonese translation (朱古力) instead of the Traditional Chinese one (巧克力) when translating into Cantonese.

Event Timeline

Marking it as high priority since it can generate confusion for the Cantonese community. Due to this bug, we are serving what they indicated that was not useful (Chinese) and not providing them what was identified as useful (the new Cantonese support from Google).

KartikMistry changed the task status from Open to In Progress.Feb 5 2025, 1:50 PM

Google seems to be listing both yue and zh-TW in supported languages for MT and we're mapping yue with zh-TW in the cxserver/lib/mt/Google.js

Google seems to be listing both yue and zh-TW in supported languages for MT and we're mapping yue with zh-TW in the cxserver/lib/mt/Google.js

Google has added support for yue recently, that is the one we should be using.

Google seems to be listing both yue and zh-TW in supported languages for MT and we're mapping yue with zh-TW in the cxserver/lib/mt/Google.js

Google has added support for yue recently, that is the one we should be using.

Right.

Change #1117914 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[mediawiki/services/cxserver@master] Google: Remove direct to zh-TW for yue

https://gerrit.wikimedia.org/r/1117914

Change #1117914 merged by jenkins-bot:

[mediawiki/services/cxserver@master] Google: Remove direct to zh-TW for yue

https://gerrit.wikimedia.org/r/1117914

Change #1118264 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Update cxserver to 2025-02-10-050623-production

https://gerrit.wikimedia.org/r/1118264

Change #1118264 merged by jenkins-bot:

[operations/deployment-charts@master] Update cxserver to 2025-02-10-050623-production

https://gerrit.wikimedia.org/r/1118264

@Pginer-WMF We have deployed this change. What's best way to test and inform the community?

@Pginer-WMF We have deployed this change. What's best way to test and inform the community?

Thanks Kartik. I informed of the update in the related tickets (T383519#10556415, T333835#10556426). I have also checked that the issue from the ticket description is solved. As captured below, the translations shown in Content Translation and Google Translate are the same for Cantonese (朱古力) :

Screenshot 2025-02-17 at 11.05.00.png (1×2 px, 731 KB)