Description

Comments added to LanguagePicker.js say that it tries to "latinize" some content based on script suffixes like _rm or _pinyin if requested language uses the same script. The way it actutally works is currently problematic in the following aspects:

It picks Latin-suffix name for non-Latin languages (T208927).
It picks codes like "sr-Latn" that is often associated to name variant that doesn't apply to any other Latin-script language. It does so even if local name (associated to OSM "name" key) already is in Latin script (T195318, T229516).
It doesn't pick "zh_pinyin" name for Latin-script languages, e.g. it's currently provided for Tongliao label node, but Dutch-language tile still displays non-Latin name.

These script suffixes should probably be ignored for latinization purpose outside relevant region. There are currently e.g. 476 uses of "sr-Latn" outside Serbia and 356 uses of "zh_pinyin" outside China/Taiwan that are probably relevant to only Serbian and Chinese itself.

If it would be possible for LanguagePicker to actually differentiate between languages by script and also the region of given name, then picking certain codes like "zh_pinyin", "ja_rm", "ko-Latn" for Latin-script languages is probably appropriate. If it isn't easy to achieve then for a start it might be more appropriate to ignore script suffixes.

Tasks mentioned above cover particular cases where wrong name is displayed. This task intends summarize the underlying issue.

Related Objects
Search...

Status	Subtype	Assigned	Task
Open		None	T230013 LanguagePicker's handling of script suffixes is broken
Open		None	T195318 Suffixed keys like "name:sr-Latn", specific to one language, are used to latinize other languages
Resolved		None	T208927 Label lang error in map
Resolved	BUG REPORT	None	T229516 OSM maps in infoboxes on English wikipedia show wrong spelling "San Huan" of "San Juan"

Event Timeline

Pikne created this task.Aug 7 2019, 11:43 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 7 2019, 11:43 AM

Pikne added subtasks: T195318: Suffixed keys like "name:sr-Latn", specific to one language, are used to latinize other languages, T208927: Label lang error in map, T229516: OSM maps in infoboxes on English wikipedia show wrong spelling "San Huan" of "San Juan" .Aug 7 2019, 2:07 PM

Haros subscribed.Aug 7 2019, 3:13 PM

Pikne updated the task description. (Show Details)Aug 29 2019, 12:07 PM

Restricted Application added a subscriber: • Petar.petkovic. · View Herald TranscriptAug 29 2019, 12:07 PM

Pikne closed subtask T208927: Label lang error in map as Resolved.Jan 18 2020, 1:31 PM

Nemo_bis added a project: I18n.Jan 26 2020, 10:31 AM

Pikne mentioned this in T195318: Suffixed keys like "name:sr-Latn", specific to one language, are used to latinize other languages.Jan 26 2020, 11:42 AM

Pikne mentioned this in T260456: [Maps] Reduce Map Sync Latency with OpenStreetMaps (OSM).Apr 1 2021, 3:50 PM

Izno subscribed.Apr 1 2021, 4:22 PM

Firefly subscribed.Apr 1 2021, 4:23 PM

Thryduulf subscribed.Apr 1 2021, 5:12 PM

Izno mentioned this in T285059: ptwiki map shows Serbian label for Portugal.Jun 16 2021, 4:18 PM

Pikne added a subtask: T285059: ptwiki map shows Serbian label for Portugal.Jun 16 2021, 6:09 PM

Aklapper removed a subtask: T285059: ptwiki map shows Serbian label for Portugal.Aug 20 2021, 9:25 AM

Aklapper added a parent task: T285059: ptwiki map shows Serbian label for Portugal.

Pikne removed a parent task: T285059: ptwiki map shows Serbian label for Portugal.Sep 14 2021, 7:38 AM

Pikne mentioned this in T305452: Investigate Kartographer transliteration and translation completeness.Apr 5 2022, 11:11 AM

TheDJ subscribed.Jun 16 2022, 8:18 AM

This problem is becoming worse and worse over time. Major place names in Manhattan are almost entirely Serbian at this point because name:sr-Latn is prioritized higher than the default name (which is English). The latinization code should probably not do anything if the default text is already entirely Latin characters.

Screenshot 2024-05-07 at 17.36.32.png (1×1 px, 2 MB)

Can we please get this ticket triaged and assigned?

AntiCompositeNumber added a project: Content-Transform-Team.May 8 2024, 3:44 PM

Riblet15 subscribed.May 10 2024, 6:26 AM

AntiCompositeNumber subscribed.May 10 2024, 10:22 PM

MSantos added a project: Essential-Work.May 16 2024, 2:31 PM

ClydeFranklin subscribed.Tue, May 21, 1:26 AM

MSantos moved this task from Backlog to Later on the Content-Transform-Team board.Thu, May 23, 2:16 PM

	F50847474: Screenshot 2024-05-07 at 17.36.32.png
	May 8 2024, 1:26 AM

LanguagePicker's handling of script suffixes is brokenOpen, Needs TriagePublicActions

Description

Related ObjectsSearch...

Event Timeline

LanguagePicker's handling of script suffixes is broken
Open, Needs TriagePublic
Actions

Related Objects
Search...