Page MenuHomePhabricator

mwdumper still has UTF-8 problems with Xerces
Closed, ResolvedPublic

Description

From GitHub comment:

This still isn't fully resolved from T59236 etc. Recommendation is to use the default Java UTF8Reader instead of the Xerces one, which should be fine with current Java versions.

Event Timeline

brion created this task.Sep 26 2017, 11:27 PM
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptSep 26 2017, 11:27 PM

Change 381733 had a related patch set uploaded (by Brion VIBBER; owner: Brion VIBBER):
[mediawiki/tools/mwdumper@master] Workaround to use JVM's UTF-8 decoder in place of Xerces one

https://gerrit.wikimedia.org/r/381733

Change 381733 merged by jenkins-bot:
[mediawiki/tools/mwdumper@master] Workaround to use JVM's UTF-8 decoder in place of Xerces one

https://gerrit.wikimedia.org/r/381733

@brion: Patch has been merged in Gerrit. Can this task be resolved, or is there more to do in this task?

Aklapper closed this task as Resolved.Fri, May 22, 1:10 PM

No reply from @brion hence assuming this has been fixed