There are a number of languages which currently don't display an English name when used on Wikidata, e.g. many of the examples on this page.
Could the following English names be added?
- abe: "Western Abenaki"
- ady-cyrl: "Adyghe (Cyrillic script)"
- aeb-arab: "Tunisian Arabic (Arabic script)"
- aeb-latn: "Tunisian Arabic (Latin script)"
- azb: "South Azerbaijani"
- bxr: "Buryat" [added as "Russia Buriat"]
- dty: "Doteli"
- ett: "Etruscan"
- fkv: "Kven"
- lbe: "Lak"
- kbd-cyrl: "Kabardian (Cyrillic script)"
- ko-kp: "Korean (North Korea)"
- koy: "Koyukon"
- ku-arab: "Kurdish (Arabic script)"
- lld: "Ladin"
- mo: "Moldovan"
- moe: "Montagnais"
- nl-informal: "Dutch (informal address)"
- nys: "Noongar"
- nod: "Northern Thai"
- otk: "Old Turkish"
- roa-tara: "Tarantino"
- rwr: "Marwari (India)"
- shi-latn: "Tachelhit (Latin script)"
- shi-tfng: "Tachelhit (Tifinagh script)"
- sje: "Pite Sami"
- tzl: "Talossan"
- zh-mo: "Chinese (Macau)"
- zh-my: "Chinese (Malaysia)"
The special code "mis" is also missing an English name. http://www-01.sil.org/iso639-3/documentation.asp?id=mis calls it "Uncoded languages" but perhaps something like "other language" or "unsupported language" would be better for the way it's used in Wikidata.
Also, while I'm requesting updates, I think the following three should be changed:
- bbc-latn: Change "Batak Toba" to "Batak Toba (Latin script)"
- gan-hans: Change "Simplified Gan script" to "Gan (Simplified)"
- gan-hant: Change "Traditional Gan script" to "Gan (Traditional)"
"Batak Toba" is currently used as the name for both bbc and bbc-latn so they aren't distinguishable. All other -latn codes include "(Latin script)" in the name.
For Gan, the current names sound really odd (the phrasing "... script" is used for script names, not languages). We normally put script information in brackets after the language name, so that's what I've suggested here. It would also match the way cjy-hans/cjy-hant are named in this file.