The [NLLB-200 model](https://ai.facebook.com/research/no-language-left-behind/) supports many languages. However, [the initial integration in Content Translation](https://www.mediawiki.org/wiki/Content_translation/Machine_Translation/NLLB-200) supports [a smaller set of 23 languages](https://phabricator.wikimedia.org/diffusion/GCXS/browse/master/config/Flores.yaml). In order to improve machine translation support, we want to support languages lack currently machine translation support by [the current services available](https://www.mediawiki.org/wiki/Help:Content_translation/Translating/Initial_machine_translation#Machine_translation_availability) but could be provided using NLLB-200.
**Candidate languages identified so far**
These languages have no current MT support enabled, but NLLB-200 could provide it :
- Santali (sat/sat_Olck) ( [view request](https://www.mediawiki.org/wiki/Topic:Wz6l388gvokhurgt) )
- Kashmiri (ks/kas_Arab/kas_Deva) ( view [request](https://www.mediawiki.org/wiki/Topic:Wx8zsebwlellf4el) and T326541 )
- Egyptian Arabic (arz/arz_Arab)
- South Azerbaijani (azb/azb_Arab)
- Balinese (ban/ban_Latn)
- Banjar (bjn/bjn_Arab/bjn_Latn)
- Tibetan (bo/bod_Tibt)
- Buginese (bug/bug_Latn)
- Crimean Tatar (crh/crh_Latn)
- Dzongkha (dz/dzo_Tibt)
- Faroese (fo/fao_Latn)
- Fijian (fj/fij_Latn)
- Fon (fon/fon_Latn) (as it graduates from incubator)
- Friulian (fur/fur_Latn)
- Fulah/Nigerian Fulfulde (ff/fuv_Latn)
**Languages listed on the NLLB-200 documentation**
These languages are [supported by the NLLB-200 model](https://github.com/facebookresearch/flores/blob/main/flores200/README.md).
!!Work in progress: marking **in bold** those wthout MT, and ~~striked~~ those with MT existing MT support. !!
- Acehnese (Arabic script) (ace_Arab)
- Acehnese (Latin script) (ace_Latn)
- Mesopotamian Arabic (acm_Arab)
- Ta’izzi-Adeni Arabic (acq_Arab)
- Tunisian Arabic (aeb_Arab)
- ~~Afrikaans (afr_Latn)~~
- South Levantine Arabic (ajp_Arab)
- Akan (aka_Latn)
- ~~Amharic (amh_Ethi)~~
- North Levantine Arabic (apc_Arab)
- Modern Standard Arabic (arb_Arab)
- Modern Standard Arabic (Romanized) (arb_Latn)
- Najdi Arabic (ars_Arab)
- Moroccan Arabic (ary_Arab)
- **Egyptian Arabic (arz_Arab)**
- ~~Assamese (asm_Beng)~~
- ~~Asturian (ast_Latn)~~
- Awadhi (awa_Deva)
- ~~Central Aymara (ayr_Latn)~~
- **South Azerbaijani (azb_Arab)**
- North Azerbaijani (azj_Latn)
- ~~Bashkir (bak_Cyrl)~~
- ~~Bambara (bam_Latn)~~
- **Balinese (ban_Latn)**
- ~~Belarusian (bel_Cyrl)~~
- //Bemba (bem_Latn)// !!no wiki yet!!
- ~~Bengali (ben_Beng)~~
- ~~Bhojpuri (bho_Deva)~~
- **Banjar (Arabic script) (bjn_Arab)**
- **Banjar (Latin script) (bjn_Latn)**
- **Standard Tibetan (bod_Tibt)**
- ~~Bosnian (bos_Latn)~~
- **Buginese (bug_Latn)**
- ~~Bulgarian (bul_Cyrl)~~
- ~~Catalan (cat_Latn)~~
- ~~Cebuano (ceb_Latn)~~
- ~~Czech (ces_Latn)~~
- //Chokwe (cjk_Latn)// !!no wiki yet!!
- ~~Central Kurdish (ckb_Arab)~~
- **Crimean Tatar (crh_Latn)**
- ~~Welsh (cym_Latn)~~
- ~~Danish (dan_Latn)~~
- ~~German~~ (deu_Latn)
- Southwestern Dinka (dik_Latn)
- Dyula (dyu_Latn)
- **Dzongkha (dzo_Tibt)**
- ~~Greek~~ (ell_Grek)
- ~~English~~ (eng_Latn)
- ~~Esperanto~~ (epo_Latn)
- ~~Estonian~~ (est_Latn)
- ~~Basque~~ (eus_Latn)
- ~~Ewe (ewe_Latn)~~
- **Faroese (fao_Latn)**
- **Fijian (fij_Latn)**
- ~~Finnish (fin_Latn)~~
- //Fon (fon_Latn)// !!wiki still in incubator!!
- ~~French (fra_Latn)~~
- **Friulian (fur_Latn)**
- **Nigerian Fulfulde (fuv_Latn)**
- ~~Scottish Gaelic (gla_Latn)~~
- ~~Irish (gle_Latn)~~
- ~~Galician (glg_Latn)~~
- ~~Guarani (grn_Latn)~~
- ~~Gujarati (guj_Gujr)~~
- ~~Haitian Creole (hat_Latn)~~
- ~~Hausa (hau_Latn)~~
- ~~Hebrew (heb_Hebr)~~
- ~~Hindi (hin_Deva)~~
- Chhattisgarhi (hne_Deva)
- Croatian (hrv_Latn)
- Hungarian (hun_Latn)
- Armenian (hye_Armn)
- Igbo (ibo_Latn)
- Ilocano (ilo_Latn)
- Indonesian (ind_Latn)
- Icelandic (isl_Latn)
- Italian (ita_Latn)
- Javanese (jav_Latn)
- Japanese (jpn_Jpan)
- Kabyle (kab_Latn)
- Jingpho (kac_Latn)
- Kamba (kam_Latn)
- Kannada (kan_Knda)
- **Kashmiri (Arabic script)** (kas_Arab)
- **Kashmiri (Devanagari script)** (kas_Deva)
- Georgian (kat_Geor)
- Central Kanuri (Arabic script) (knc_Arab)
- Central Kanuri (Latin script) (knc_Latn)
- Kazakh (kaz_Cyrl)
- Kabiyè (kbp_Latn)
- Kabuverdianu (kea_Latn)
- Khmer (khm_Khmr)
- Kikuyu (kik_Latn)
- Kinyarwanda (kin_Latn)
- Kyrgyz (kir_Cyrl)
- Kimbundu (kmb_Latn)
- Northern Kurdish (kmr_Latn)
- Kikongo (kon_Latn)
- Korean (kor_Hang)
- Lao (lao_Laoo)
- Ligurian (lij_Latn)
- Limburgish (lim_Latn)
- Lingala (lin_Latn)
- Lithuanian (lit_Latn)
- Lombard (lmo_Latn)
- Latgalian (ltg_Latn)
- Luxembourgish (ltz_Latn)
- Luba-Kasai (lua_Latn)
- Ganda (lug_Latn)
- Luo (luo_Latn)
- Mizo (lus_Latn)
- Standard Latvian (lvs_Latn)
- Magahi (mag_Deva)
- Maithili (mai_Deva)
- Malayalam (mal_Mlym)
- Marathi (mar_Deva)
- Minangkabau (Arabic script) (min_Arab)
- Minangkabau (Latin script) (min_Latn)
- Macedonian (mkd_Cyrl)
- Plateau Malagasy (plt_Latn)
- Maltese (mlt_Latn)
- Meitei (Bengali script) (mni_Beng)
- Halh Mongolian (khk_Cyrl)
- Mossi (mos_Latn)
- Maori (mri_Latn)
- Burmese (mya_Mymr)
- Dutch (nld_Latn)
- Norwegian Nynorsk (nno_Latn)
- Norwegian Bokmål (nob_Latn)
- Nepali (npi_Deva)
- Northern Sotho (nso_Latn)
- Nuer (nus_Latn)
- Nyanja (nya_Latn)
- Occitan (oci_Latn)
- West Central Oromo (gaz_Latn)
- Odia (ory_Orya)
- Pangasinan (pag_Latn)
- Eastern Panjabi (pan_Guru)
- Papiamento (pap_Latn)
- Western Persian (pes_Arab)
- Polish (pol_Latn)
- Portuguese (por_Latn)
- Dari (prs_Arab)
- Southern Pashto (pbt_Arab)
- Ayacucho Quechua (quy_Latn)
- Romanian (ron_Latn)
- Rundi (run_Latn)
- Russian (rus_Cyrl)
- Sango (sag_Latn)
- Sanskrit (san_Deva)
- **Santali** (sat_Olck)
- Sicilian (scn_Latn)
- Shan (shn_Mymr)
- Sinhala (sin_Sinh)
- Slovak (slk_Latn)
- Slovenian (slv_Latn)
- Samoan (smo_Latn)
- Shona (sna_Latn)
- Sindhi (snd_Arab)
- Somali (som_Latn)
- Southern Sotho (sot_Latn)
- Spanish (spa_Latn)
- Tosk Albanian (als_Latn)
- Sardinian (srd_Latn)
- Serbian (srp_Cyrl)
- Swati (ssw_Latn)
- Sundanese (sun_Latn)
- Swedish (swe_Latn)
- Swahili (swh_Latn)
- Silesian (szl_Latn)
- Tamil (tam_Taml)
- Tatar (tat_Cyrl)
- Telugu (tel_Telu)
- Tajik (tgk_Cyrl)
- Tagalog (tgl_Latn)
- Thai (tha_Thai)
- Tigrinya (tir_Ethi)
- Tamasheq (Latin script) (taq_Latn)
- Tamasheq (Tifinagh script) (taq_Tfng)
- Tok Pisin (tpi_Latn)
- Tswana (tsn_Latn)
- Tsonga (tso_Latn)
- Turkmen (tuk_Latn)
- Tumbuka (tum_Latn)
- Turkish (tur_Latn)
- Twi (twi_Latn)
- Central Atlas Tamazight (tzm_Tfng)
- Uyghur (uig_Arab)
- Ukrainian (ukr_Cyrl)
- Umbundu (umb_Latn)
- Urdu (urd_Arab)
- Northern Uzbek (uzn_Latn)
- Venetian (vec_Latn)
- Vietnamese (vie_Latn)
- Waray (war_Latn)
- Wolof (wol_Latn)
- Xhosa (xho_Latn)
- Eastern Yiddish (ydd_Hebr)
- Yoruba (yor_Latn)
- Yue Chinese (yue_Hant)
- Chinese (Simplified) (zho_Hans)
- Chinese (Traditional) (zho_Hant)
- Standard Malay (zsm_Latn)
- Zulu (zul_Latn)