Page MenuHomePhabricator

Add bfw, gju, hoc and kgg to language names
Open, Needs TriagePublicBUG REPORT

Description

Steps to replicate the issue (include links if applicable):

  • Go to a video file on Commons (example) and click on TimedText option on top left.
  • Search for language codes bfw, gju, hoc or kgg from dropdown

What happens?:
The language codes don't appear

What should have happened instead?:
The language codes should appear just like en - English or fr - français. The languages for which the code should be added are:

'bfw' => 'Bonda/Remosam/ବଣ୍ଡା',
'gju' => 'गुज्जरी/Gujari/Gojri',
'hoc' => 'Ho/𑢹𑣉𑣉 𑣎𑣋𑣜',
'kgg' => 'Kusunda/Gemehaq gipan'

Software version (on Special:Version page; skip for WMF-hosted wikis like Wikipedia):

Other information (browser name/version, screenshots, etc.):

Event Timeline

jhsoby subscribed.

This is not a request for a new language for translation though. @psubhashish1 wants to add (and presumably show) subtitles in these languages which aren't supported. He shouldn't be required to translate (or find someone to translate) the MediaWiki interface in order to do that.

FWIW, Subhashish, you can already save the TimedText in these languages – just follow the procedure to add TimedText as you normally would (but choosing another language), and when you get to the TimedText page, you replace the language code in the URL with one of these language codes. There is, however, no way (that I've found) to actually display those subtitles yet, so the usability for anything other than archiving purposes is limited.

I made a couple of tests just now:

The difference between the two is that in InitialiseSettings.php, rmf has been added to wmgExtraLanguageNames for Commons, while there's no such config for Beta Commons. So the way to solve this ticket would be adding the four requested lanugages to wmgExtraLanguageNames for Commons (and Wikidata, since it's better to keep the two lists in sync).

@psubhashish1 We need the autonyms to add them to that config. You submitted two or three names per language in the task description – can you clarify which ones are the autonyms that should be used?

jhsoby renamed this task from Add languages with ISO codes to Add bfw, gju, hoc and kgg to language names.Wed, Dec 11, 10:12 AM

Thanks for the clear guidance, @jhsoby.

I indeed had created the Gujari subs in a similar way you mentioned (creating subs from any language and editing the language code from URL) -- see this. However, it doesn't appear in the video CC option. That's the biggest problem.

Can we have hyphenated pseudonyms for names? In the list below, the first one is the common name, the second is the endonym, and the third is the name in the native script. In the absence of a third name, the second one is the name in native language/script.

'bfw' => 'Bonda - Remosam - ବଣ୍ଡା',
'gju' => 'Gujari - گوجری - गुज्जरी',
'hoc' => 'Ho - 𑢹𑣉𑣉 𑣎𑣋𑣜',
'kgg' => 'Kusunda - Gemehaq gipan'

Note that Gujari is most often written in the Arabic script (گوجری).

@mrephabricator, Gujari is actually written in multiple scripts. Most Gujari speakers in India are in states where Hindi is a dominant and official language, and in Nepal where Nepali (written in Devanagari) is the national language. Gujari is predominately an oral language and is not taught widely in schools or is used in media or publications. So, those residing in India and Nepal learn Devanagari due to school education and use it for Gujari when they need to. Van Gujjari, a variant of Gujjari, uses Devanagari for this reason even though all Van Gujjars are Muslims who would have preferred Perso-Arabic over Devanagari. Over 174 books are translated and several books are published in Van Gujjari and these are in Devanagari. Gujjars living in Pakistan and Afghanistan would use Perso-Arabic. For that reason, keeping the name in both Perso-Arabic and Devanagari might be helpful (I've edited my earlier comment). Thanks for suggesting this.

Can we have hyphenated pseudonyms for names?

No, not really, that would be breaking with the standard we've decided on for these lists. The names already in the list follow the same format as MediaWiki core's Names.php. So we can't really have names that are in a different language than the language itself.

In the list below, the first one is the common name, the second is the endonym, and the third is the name in the native script. In the absence of a third name, the second one is the name in native language/script.

In case the language is written in more than one script, we can add codes for each script. So since Gujari is written (when it is written at all) both with Arabic and Devanagari scripts, we can add 'gju-arab' => 'گوجری', and 'gju-deva' => 'गुज्जरी',.

Ok, we can have just the following for the time being:

'bfw' => 'Bonda',
'gju' => 'Gujari',
'hoc' => 'Ho',
'kgg' => 'Kusunda'
'gju-arab' => 'گوجری',
'gju-deva' => 'गुज्जरी',