Page MenuHomePhabricator

Update MediaWiki to use Unicode 8.0.0 (utfnormal, Scribunto, etc.)
Closed, ResolvedPublic

Description

Wikimedia is updating to libicu57, so we should update the version of Unicode used in utfnormal and Scribunto to match. These patches should be deployed after libicu finishes being upgraded.

Event Timeline

We already updated css-sanitizer for the new utfnormal version, but it looks like we also need to update RemexHtml and Equivset: https://packagist.org/packages/wikimedia/utfnormal/dependents

As a side note, the Unicode situation in our environment is complicated:

  • libucu57, and thus PHP's intl extension, will use Unicode 8.0.0.
  • But the Unicode support in PCRE functions is not tied to that. The current version of libpcre on terbium is 8.35, while stretch has 8.39. The PCRE changelog entries mentioning Unicode version updates indicate that these use 6.3.0 and 7.0.0, respectively.
  • The mbstring library in HHVM uses a really old version of Unicode, maybe 3.2 assuming they copied it from Zend PHP before this commit. PHP7.0's mbstring uses Unicode 8.0.0, and 7.2 uses Unicode 10.

Change 426849 had a related patch set uploaded (by Legoktm; owner: Legoktm):
[mediawiki/vendor@master] Update wikimedia/utfnormal to 2.0.0

https://gerrit.wikimedia.org/r/426849

Change 426850 had a related patch set uploaded (by Legoktm; owner: Legoktm):
[mediawiki/core@master] Update wikimedia/utfnormal to 2.0.0

https://gerrit.wikimedia.org/r/426850

Change 426849 merged by jenkins-bot:
[mediawiki/vendor@master] Update wikimedia/utfnormal to 2.0.0

https://gerrit.wikimedia.org/r/426849

Change 426850 merged by jenkins-bot:
[mediawiki/core@master] Update wikimedia/utfnormal to 2.0.0

https://gerrit.wikimedia.org/r/426850

Legoktm claimed this task.