Page MenuHomePhabricator

Languages with redirects appear duplicated in the ULS panel
Open, MediumPublic

Description

If you are interested in working on this task, you can create a subtask for the language wiki (see the checklist in the task description below) that you plan to work on and assign the task to yourself.

Description
Some languages with variants and redirects can appear in the Universal Language Selector (ULS) twice. In lieu of fixing T372034, it seems that not only is Punjabi (Gurmukhi) appearing twice, but there are also around 30 other languages with redirects showing up twice as well.

For Punjabi, for example, in data/langdb.yaml and data/language-data.json, pa-guru was made a redirect for pa instead of the other way around. And language codes, if any, were reversed in tests/js/index.js.

Example fixes:

This process needs to be repeated for all affected languages.

Steps to follow:

  • Update language-data:
    • Fix redirects in langdb.yaml and language-data.json
    • Check and reverse language code if needed in tests/js/index.js
  • Update jquery.uls with the new language data from upstream
#Languages to check and fixAssigneeStatus
1адыгабзэ (ady-cyrl/ady)
2تونسي / Tûnsî (aeb/aeb-arab)@srishakatuxT396082
3azərbaycanca (az/az-latn)@NemoralisT394873
4تۆرکجه (azb/az-arab)@NemoralisT394873
5Batak Toba (bbc/bbc-latn)
6भोजपुरी (bh/bho)
7Chavacano de Zamboanga (cbk-zam/cbk)
8qırımtatarca (Latin) (crh-latn/crh)
9गोंयची कोंकणी / Gõychi Konknni (gom/gom-deva)
10Fiji Hindi (hif-latn/hif)
11ᐃᓄᒃᑎᑐᑦ / inuktitut (iu/ike-cans)
12адыгэбзэ (kbd-cyrl/kbd)
13қазақша (kk/kk-cyrl)
14قازاقشا (جۇنگو) (kk-cn/kk-arab)
15қазақша (Қазақстан) (kk-kz/kk-cyrl)
16qazaqşa (Türkïya) (kk-tr/kk-latn)
17kurdî (ku/ku-latn)
18Ladino (lad/lad-latn)
19ਪੰਜਾਬੀ (pa/pa-guru)@srishakatuxT372034
20Влахесте (ruq-cyrl/ruq)
21Taclḥit (shi/shi-latn)
22tacawit (shy/shy-latn)
23سرائیکی (skr/skr-arab)
24српски / srpski (sr/sr-cyrl)
25српски (ћирилица) (sr-ec/sr-cyrl)
26srpski (latinica) (sr-el/sr-latn)
27тоҷикӣ (tg/tg-cyrl)@srishakatux, @Amire80411, 413 (on GitHub) ✅
28татарча (tt-cyrl/tt)
29ئۇيغۇرچە / Uyghurche (ug/ug-arab)
30कॉशुर / کٲشُر (ks/ks-arab)

Event Timeline

Vvjjkkii renamed this task from Languages with redirects appear duplicated in the ULS panel to 5gcaaaaaaa.Jul 1 2018, 1:08 AM
Vvjjkkii triaged this task as High priority.
Vvjjkkii updated the task description. (Show Details)
Vvjjkkii removed a subscriber: Aklapper.
CommunityTechBot renamed this task from 5gcaaaaaaa to Languages with redirects appear duplicated in the ULS panel.Jul 2 2018, 3:57 PM
CommunityTechBot raised the priority of this task from High to Needs Triage.
CommunityTechBot updated the task description. (Show Details)
CommunityTechBot added a subscriber: Aklapper.

In lieu of fixing T372034, it seems that not only is Punjabi (Gurmukhi) appearing twice in the Universal Language Selector, but there are also around 30 other languages with redirects appearing twice as well. 14 of these languages are also included in @Amire80's list above.

  1. адыгабзэ (ady-cyrl/ady)
  2. تونسي / Tûnsî (aeb/aeb-arab)
  3. azərbaycanca (az/az-latn)
  4. تۆرکجه (azb/az-arab)
  5. Batak Toba (bbc/bbc-latn)
  6. भोजपुरी (bh/bho)
  7. Chavacano de Zamboanga (cbk-zam/cbk)
  8. qırımtatarca (Latin) (crh-latn/crh)
  9. गोंयची कोंकणी / Gõychi Konknni (gom/gom-deva)
  10. Fiji Hindi (hif-latn/hif)
  11. ᐃᓄᒃᑎᑐᑦ / inuktitut (iu/ike-cans)
  12. адыгэбзэ (kbd-cyrl/kbd)
  13. қазақша (kk/kk-cyrl)
  14. قازاقشا (جۇنگو) (kk-cn/kk-arab)
  15. қазақша (Қазақстан) (kk-kz/kk-cyrl)
  16. qazaqşa (Türkïya) (kk-tr/kk-latn)
  17. कॉशुर / کٲشُر (ks/ks-arab)
  18. kurdî (ku/ku-latn)
  19. Ladino (lad/lad-latn)
  20. ਪੰਜਾਬੀ (pa/pa-guru)
  21. Влахесте (ruq-cyrl/ruq)
  22. Taclḥit (shi/shi-latn)
  23. tacawit (shy/shy-latn)
  24. سرائیکی (skr/skr-arab)
  25. српски / srpski (sr/sr-cyrl)
  26. српски (ћирилица) (sr-ec/sr-cyrl)
  27. srpski (latinica) (sr-el/sr-latn)
  28. тоҷикӣ (tg/tg-cyrl)
  29. татарча (tt-cyrl/tt)
  30. ئۇيغۇرچە / Uyghurche (ug/ug-arab)

@abi_ @santhosh Could we get one of you to help review the patch (linked in a comment above)?

Following what we did for Punjabi to redirect pa-guru to pa here T372034, here is a first patch for this task which addresses an issue with data formatting for Tunisian Arabic:
https://github.com/wikimedia/language-data/pull/403.

If this patch is okay, we can repeat the same for languages listed in T195342#10123652.

Change #1121436 had a related patch set uploaded (by Srishakatux; author: Srishakatux):

[mediawiki/extensions/UniversalLanguageSelector@master] Update `jquery.uls` from upstream

https://gerrit.wikimedia.org/r/1121436

Change #1121436 merged by jenkins-bot:

[mediawiki/extensions/UniversalLanguageSelector@master] Update `jquery.uls` from upstream

https://gerrit.wikimedia.org/r/1121436

MaryMunyoki changed the task status from Open to In Progress.Feb 25 2025, 2:35 PM
srishakatux raised the priority of this task from Low to Medium.May 9 2025, 1:06 AM
srishakatux updated the task description. (Show Details)

Thank you for tagging this task with good first task for Wikimedia newcomers!

Newcomers often may not be aware of things that may seem obvious to seasoned contributors, so please take a moment to reflect on how this task might look to somebody who has never contributed to Wikimedia projects.

A good first task is a self-contained, non-controversial task with a clear approach. It should be well-described with pointers to help a completely new contributor, for example it should clearly pointed to the codebase URL and provide clear steps to help a contributor get setup for success. We've included some guidelines at https://phabricator.wikimedia.org/tag/good_first_task/ !

Thank you for helping us drive new contributions to our projects <3

Thanks for claiming this task, @Nemoralis!

I also wanted to remind everyone that, since this task involves making changes for 30+ languages, multiple people can work on it together :) I see that @sathvik9105 and @Adityakumar0545 have expressed interest as well.

srishakatux updated the task description. (Show Details)
srishakatux added a subscriber: Nemoralis.
srishakatux changed the task status from In Progress to Open.May 21 2025, 12:03 AM
srishakatux updated the task description. (Show Details)

Just like with az and azb T394873, it appears that aeb T396082 may also have another issue causing the language to still appear twice in ULS. Previously, it was observed that redirects can sometimes cause duplicates in the Suggested Languages list in ULS, as was the case with az. For aeb, it could be something else, for example, a redirect added in Names.php in MediaWiki core. Although T396082 is being closed for now, as the redirect has been fixed in the language data repository, something broader might still be causing the duplicate display. Currently, aeb only appears when you search using Tunisian arabic rather than the language code aeb itself.