Page MenuHomePhabricator

TextMatch: implement i18n-aware case preservation
Closed, ResolvedPublic

Description

Without case preservation, replacements would tend to break uppercased and titlecased text.

For example, a rule like “analysing -> analyzing” would turn “Analysing results” into “analyzing results”.

The case preservation needs to:

  • work in non-Latin scripts
  • support accented characters
  • not lowercase terms with fixed uppercasing (like SMS or France or McFly)
  • Support language-specific uppercasing rules (e.g. Turkish dotless-I and dotted-I)

Event Timeline

Change #1279349 had a related patch set uploaded (by Divec; author: Divec):

[mediawiki/extensions/VisualEditor@master] TextMatch: implement i18n-aware case preservation

https://gerrit.wikimedia.org/r/1279349

Change #1279349 merged by jenkins-bot:

[mediawiki/extensions/VisualEditor@master] TextMatch: implement i18n-aware case preservation

https://gerrit.wikimedia.org/r/1279349

Tested with accented characters:

"naive": "naïve",
"fööbar": "qüux",
"wîbble": "wôbble",
"frobnicate": "frob",
"qux": "xyzzy",
"plain": "éclair"