ICU orders characters by script group (Latin/Hani) first, but we want to avoid having first-letter sections separately for Pinyin and English words on different pages of a large category.
Description
Description
Details
Details
Related Changes in Gerrit:
| Subject | Repo | Branch | Lines +/- | |
|---|---|---|---|---|
| Collation: Introduce a tailored collation for Chinese Pinyin sorting | mediawiki/core | master | +149 -13 |
Event Timeline
Comment Actions
Change #1241256 had a related patch set uploaded (by Func; author: Func):
[mediawiki/core@master] IcuCollation: Group Pinyin initials and Latin characters into buckets
Comment Actions
How will manual pinyin overrides (if that exists) for characters not pronouncing as usual behave under this patch?
Comment Actions
This patch did not change the behaviour for manual overrides, so they would be sorted as English words as previously. The issue will be investigated and improved as part of T401456, or you may file a subtask specifically for in-page manual overrides. I have some WIP on T401456, may post my findings in a few days.
Comment Actions
Change #1241256 merged by jenkins-bot:
[mediawiki/core@master] Collation: Introduce a tailored collation for Chinese Pinyin sorting

