The [NLLB-200 model](https://ai.facebook.com/research/no-language-left-behind/) supports many languages. However, [the initial integration in Content Translation](https://www.mediawiki.org/wiki/Content_translation/Machine_Translation/NLLB-200) supports [a smaller set of 23 languages](https://phabricator.wikimedia.org/diffusion/GCXS/browse/master/config/Flores.yaml). In order to improve machine translation support, we want to support languages lack currently machine translation support by [the current services available](https://www.mediawiki.org/wiki/Help:Content_translation/Translating/Initial_machine_translation#Machine_translation_availability) but could be provided using NLLB-200.
**Candidate languages identified so far**
These languages have no current MT support enabled, but NLLB-200 could provide it :
# Kashmiri (ks/kas_Arab) ( view [request](https://www.mediawiki.org/wiki/Topic:Wx8zsebwlellf4el) and T326541 )
- Santali (sat/sat_Olck) ( [view request](https://www.mediawiki.org/wiki/Topic:Wz6l388gvokhurgt) )
- Egyptian Arabic (arz/arz_Arab)
- Moroccan Arabic (ary/ary_Arab)
- Kabyle (kab/kab_Latn)
- Banjar (bjn/bjn_Arab/bjn_Latn)
- South Azerbaijani (azb/azb_Arab)
- Lombard (lmo/lmo_Latn)
- Crimean Tatar (crh/crh_Latn)
- Balinese (ban/ban_Latn)
- Faroese (fo/fao_Latn)
- Ligurian (lij/lij_Latn)
- Acehnese (ace/ace_Arab/ace_Latn)
- Silesian (szl/szl_Latn)
- Venetian (vec/vec_Latn)
- Tibetan (bo/bod_Tibt)
- Fulah/Nigerian Fulfulde (ff/fuv_Latn)
- Waray (war/war_Latn)
- Sicilian (scn/scn_Latn)
- Friulian (fur/fur_Latn)
- Limburgish (li/lim_Latn)
- Pangasinan (pag/pag_Latn)
- Buginese (bug/bug_Latn)
- Tok Pisin (tpi/tpi_Latn)
- Fijian (fj/fij_Latn)
- Southwestern Dinka (din/dik_Latn) (pending to check if this particular variant is appropriate for Dinka Wikipedia)
- Rundi (rn/run_Latn)
- Shan (shn/shn_Mymr)
- Kabiyè (kbp/kbp_Latn)
- Latgalian (ltg/ltg_Latn)
- Dzongkha (dz/dzo_Tibt)
- Kikuyu (ki/kik_Latn)
- Sango (sg/sag_Latn)
- Awadhi (awa/awa_Deva)
- Tumbuka (umb/tum_Latn)
- Fon (fon/fon_Latn) (as it graduates from incubator)
- ~~Akan (ak/aka_Latn)~~ (Akan may be supported using Twi)
- ~~Kanuri/Central Kanuri (kr/knc_Arab/knc_Latn)~~ (Wikipedia was closed, back to incubator)
**Languages already supported by NLLB-200 as their only option**
From the list of languages [currently supported by NLLB-200](https://phabricator.wikimedia.org/diffusion/GCXS/browse/master/config/Flores.yaml), these are those not supported by other services:
1. Asturian (ast/ast_Latn)
- Kongo/Kikongo (kg/kon_Latn)
- Northern Sotho (nso/nso_Latn)
- Occitan (oc/oci_Latn)
- Swati (ss/ssw_Latn)
- Tswana (tn/tsn_Latn)
- Wolof (wo/wol_Latn)
- Cantonese/Yue Chinese (zh-yue/yue_Hant)
**Languages listed on the NLLB-200 documentation**
These languages are [supported by the NLLB-200 model](https://github.com/facebookresearch/flores/blob/main/flores200/README.md).
Marking **in bold** those wthout MT, and ~~striked~~ those with MT existing MT support.
- **Acehnese (Arabic script) (ace_Arab)**
- **Acehnese (Latin script) (ace_Latn)**
- //Mesopotamian Arabic (acm_Arab)// !!no wiki yet!!
- //Ta’izzi-Adeni Arabic (acq_Arab)// !!no wiki yet!!
- //Tunisian Arabic (aeb_Arab)// !!no wiki yet!!
- ~~Afrikaans (afr_Latn)~~
- //South Levantine Arabic (ajp_Arab)// !!no wiki yet!!
- **Akan (aka_Latn)**
- ~~Amharic (amh_Ethi)~~
- //North Levantine Arabic (apc_Arab)// !!no wiki yet!!
- ~~Modern Standard Arabic (arb_Arab)~~
- ~~Modern Standard Arabic (Romanized) (arb_Latn)~~
- //Najdi Arabic (ars_Arab)// !!no wiki yet!!
- **Moroccan Arabic (ary_Arab)**
- **Egyptian Arabic (arz_Arab)**
- ~~Assamese (asm_Beng)~~
- ~~Asturian (ast_Latn)~~
- **Awadhi (awa_Deva)**
- ~~Central Aymara (ayr_Latn)~~
- **South Azerbaijani (azb_Arab)**
- //North Azerbaijani (azj_Latn)// !!no wiki yet!!
- ~~Bashkir (bak_Cyrl)~~
- ~~Bambara (bam_Latn)~~
- **Balinese (ban_Latn)**
- ~~Belarusian (bel_Cyrl)~~
- //Bemba (bem_Latn)// !!no wiki yet!!
- ~~Bengali (ben_Beng)~~
- ~~Bhojpuri (bho_Deva)~~
- **Banjar (Arabic script) (bjn_Arab)**
- **Banjar (Latin script) (bjn_Latn)**
- **Standard Tibetan (bod_Tibt)**
- ~~Bosnian (bos_Latn)~~
- **Buginese (bug_Latn)**
- ~~Bulgarian (bul_Cyrl)~~
- ~~Catalan (cat_Latn)~~
- ~~Cebuano (ceb_Latn)~~
- ~~Czech (ces_Latn)~~
- //Chokwe (cjk_Latn)// !!no wiki yet!!
- ~~Central Kurdish (ckb_Arab)~~
- **Crimean Tatar (crh_Latn)**
- ~~Welsh (cym_Latn)~~
- ~~Danish (dan_Latn)~~
- ~~German~~ (deu_Latn)
- **Southwestern Dinka (dik_Latn)**
- //Dyula (dyu_Latn)// !!no wiki yet!!
- **Dzongkha (dzo_Tibt)**
- ~~Greek~~ (ell_Grek)
- ~~English~~ (eng_Latn)
- ~~Esperanto~~ (epo_Latn)
- ~~Estonian~~ (est_Latn)
- ~~Basque~~ (eus_Latn)
- ~~Ewe (ewe_Latn)~~
- **Faroese (fao_Latn)**
- **Fijian (fij_Latn)**
- ~~Finnish (fin_Latn)~~
- **//Fon (fon_Latn)//** !!wiki still in incubator!!
- ~~French (fra_Latn)~~
- **Friulian (fur_Latn)**
- **Nigerian Fulfulde (fuv_Latn)**
- ~~Scottish Gaelic (gla_Latn)~~
- ~~Irish (gle_Latn)~~
- ~~Galician (glg_Latn)~~
- ~~Guarani (grn_Latn)~~
- ~~Gujarati (guj_Gujr)~~
- ~~Haitian Creole (hat_Latn)~~
- ~~Hausa (hau_Latn)~~
- ~~Hebrew (heb_Hebr)~~
- ~~Hindi (hin_Deva)~~
- //Chhattisgarhi (hne_Deva)// !!no wiki yet!!
- ~~Croatian (hrv_Latn)~~
- ~~Hungarian (hun_Latn)~~
- ~~Armenian (hye_Armn)~~
- ~~Igbo (ibo_Latn)~~
- ~~Ilocano (ilo_Latn)~~
- ~~Indonesian (ind_Latn)~~
- ~~Icelandic (isl_Latn)~~
- ~~Italian (ita_Latn)~~
- ~~Javanese (jav_Latn)~~
- ~~Japanese (jpn_Jpan)~~
- **Kabyle (kab_Latn)**
- //Jingpho (kac_Latn)// !!no wiki yet!!
- //Kamba (kam_Latn)// !!no wiki yet!!
- ~~Kannada (kan_Knda)~~
- **Kashmiri (Arabic script)** (kas_Arab)
- **Kashmiri (Devanagari script)** (kas_Deva)
- ~~Georgian (kat_Geor)~~
- **Central Kanuri (Arabic script) (knc_Arab)**
- **Central Kanuri (Latin script) (knc_Latn)**
- ~~Kazakh (kaz_Cyrl)~~
- **Kabiyè (kbp_Latn)**
- //Kabuverdianu (kea_Latn)// !!no wiki yet!!
- ~~Khmer (khm_Khmr)~~
- **Kikuyu (kik_Latn)**
- ~~Kinyarwanda (kin_Latn)~~
- ~~Kyrgyz (kir_Cyrl)~~
- //Kimbundu (kmb_Latn)// !!no wiki yet!!
- ~~Northern Kurdish (kmr_Latn)~~
- ~~Kikongo (kon_Latn)~~
- ~~Korean (kor_Hang)~~
- ~~Lao (lao_Laoo)~~
- **Ligurian (lij_Latn)**
- **Limburgish (lim_Latn)**
- ~~Lingala (lin_Latn)~~
- ~~Lithuanian (lit_Latn)~~
- **Lombard (lmo_Latn)**
- **Latgalian (ltg_Latn)**
- ~~Luxembourgish (ltz_Latn)~~
- //Luba-Kasai (lua_Latn)// !!no wiki yet!!
- ~~Ganda (lug_Latn)~~
- //Luo (luo_Latn)// !!no wiki yet!!
- //Mizo (lus_Latn)// !!no wiki yet!!
- ~~Standard Latvian (lvs_Latn)~~
- //Magahi (mag_Deva)// !!no wiki yet!!
- ~~Maithili (mai_Deva)~~
- ~~Malayalam (mal_Mlym)~~
- ~~Marathi (mar_Deva)~~
- **Minangkabau (Arabic script) (min_Arab)**
- **Minangkabau (Latin script) (min_Latn)**
- ~~Macedonian (mkd_Cyrl)~~
- //Plateau Malagasy (plt_Latn)// !!no wiki yet!!
- ~~Maltese (mlt_Latn)~~
- ~~Meitei (Bengali script) (mni_Beng)~~
- //Halh Mongolian (khk_Cyrl)// !!no wiki yet!!
- //Mossi (mos_Latn)// !!no wiki yet!!
- ~~Maori (mri_Latn)~~
- ~~Burmese (mya_Mymr)~~
- ~~Dutch (nld_Latn)~~
- ~~Norwegian Nynorsk (nno_Latn)~~
- ~~Norwegian Bokmål (nob_Latn)~~
- ~~Nepali (npi_Deva)~~
- ~~Northern Sotho (nso_Latn)~~
- //Nuer (nus_Latn)// !!no wiki yet!!
- ~~Nyanja (nya_Latn)~~
- ~~Occitan (oci_Latn)~~
- ~~West Central Oromo (gaz_Latn)~~
- ~~Odia (ory_Orya)~~
- **Pangasinan (pag_Latn)**
- ~~Eastern Panjabi (pan_Guru)~~
- ~~Papiamento (pap_Latn)~~
- //Western Persian (pes_Arab)// !!no wiki yet!!
- ~~Polish (pol_Latn)~~
- ~~Portuguese (por_Latn)~~
- //Dari (prs_Arab)// !!no wiki yet!!
- //Southern Pashto (pbt_Arab)// !!no wiki yet!!
- //Ayacucho Quechua (quy_Latn)// !!no wiki yet!!
- ~~Romanian (ron_Latn)~~
- **Rundi (run_Latn)**
- ~~Russian (rus_Cyrl)~~
- **Sango (sag_Latn)**
- ~~Sanskrit (san_Deva)~~
- **Santali** (sat_Olck)
- **Sicilian (scn_Latn)**
- **Shan (shn_Mymr)**
- ~~Sinhala (sin_Sinh)~~
- ~~Slovak (slk_Latn)~~
- ~~Slovenian (slv_Latn)~~
- ~~Samoan (smo_Latn)~~
- ~~Shona (sna_Latn)~~
- ~~Sindhi (snd_Arab)~~
- ~~Somali (som_Latn)~~
- ~~Southern Sotho (sot_Latn)~~
- ~~Spanish (spa_Latn)~~
- //Tosk Albanian (als_Latn)// !!no wiki yet!!
- **Sardinian (srd_Latn)**
- ~~Serbian (srp_Cyrl)~~
- ~~Swati (ssw_Latn)~~
- ~~Sundanese (sun_Latn)~~
- ~~Swedish (swe_Latn)~~
- ~~Swahili (swh_Latn)~~
- **Silesian (szl_Latn)**
- ~~Tamil (tam_Taml)~~
- ~~Tatar (tat_Cyrl)~~
- ~~Telugu (tel_Telu)~~
- ~~Tajik (tgk_Cyrl)~~
- ~~Tagalog (tgl_Latn)~~
- ~~Thai (tha_Thai)~~
- ~~Tigrinya (tir_Ethi)~~
- //Tamasheq (Latin script) (taq_Latn)// !!no wiki yet!!
- //Tamasheq (Tifinagh script) (taq_Tfng)// !!no wiki yet!!
- **Tok Pisin (tpi_Latn)**
- ~~Tswana (tsn_Latn)~~
- ~~Tsonga (tso_Latn)~~
- ~~Turkmen (tuk_Latn)~~
- **Tumbuka (tum_Latn)**
- ~~Turkish (tur_Latn)~~
- ~~Twi (twi_Latn)~~
- //Central Atlas Tamazight (tzm_Tfng)// !!no wiki yet!!
- ~~Uyghur (uig_Arab)~~
- ~~Ukrainian (ukr_Cyrl)~~
- //Umbundu (umb_Latn)// !!no wiki yet!!
- ~~Urdu (urd_Arab)~~
- //Northern Uzbek (uzn_Latn)// !!no wiki yet!!
- **Venetian (vec_Latn)**
- ~~Vietnamese (vie_Latn)~~
- **Waray (war_Latn)**
- ~~Wolof (wol_Latn)~~
- ~~Xhosa (xho_Latn)~~
- //Eastern Yiddish (ydd_Hebr)// !!no wiki yet!!
- ~~Yoruba (yor_Latn)~~
- ~~Yue Chinese (yue_Hant)~~
- ~~Chinese (Simplified) (zho_Hans)~~
- ~~Chinese (Traditional) (zho_Hant)~~
- //Standard Malay (zsm_Latn)// !!no wiki yet!!
- ~~Zulu (zul_Latn)~~