A set of new languages are supported by Flores, as well as new language pairs for some languages previously supported. We want to enable these in the test instance of Content Translation in order to check how they work.
The languages supported are the following:
- bak/ba (Bashkir)
- hau/ha (Hausa)
- ast (Asturian)
- tso/ts (Tsonga)
- asm/as (Assamese)
- ckb (Sorani / Central Kurdish)
- ilo (Iloko)
- kon/kg (Kongo)
- lin/ln (Lingala)
- nso (Northern Sotho)
- ssw/ss (Swati)
- tsn/tn (Tswana)
- yue/zh-yue (Cantonese)
- orm/om (Oromo)
- tir/ti (Tigrinya)
- wol/wo (Wolof)
- ayr (Central Aymara, for Aymara Wikipedia: ay))
- isl/is (Icelandic) (new pairs)
- zho_Hans/zh (Chinese) (new pairs)
- ibo/ig (Igbo) (new pairs)
- zul/zu (Zulu) (new pairs)
- lug/lg (Luganda) (new pairs)
- oci/oc (Occitan) (new pairs)
The specific language pairs to enable are:
- en → {asm, ast, ayr, bak, ckb, hau, ilo, kon, lin, nso, orm, ssw, tsn, tso, yue, wol}
- fr → {asm, ast, ayr, bak, ckb, hau, ibo, ilo, isl, kon, lin, lug, nso, oci, orm, ssw, tir, tsn, tso, wol, yue, zho_Hans, zul}
- es → {asm, ast, ayr, bak, ckb, hau, ibo, ilo, isl, kon, lin, lug, nso, oci, orm, ssw, tir, tsn, tso, yue, zho_Hans, zul}
- {cat, por} → oci
- zho_Hans → yue
- rus → bak
Note that:
- The languages listed on top use both 3-letter and 2-letter iso-codes when the corresponding Wikipedia uses the 2-letter ones. The language pairs are listed using the 3-letter codes. Still, we need to determine how to capture them in the configuration to avoid a mismatch between the translation service and Wikipedia codes.
- For Bashkir and Hausa we want to keep the current default services. So Flores will not be the default initially for those.