Page MenuHomePhabricator

Provide automated conversion to UTF-8 and binary schemas for MySQL 4.1/5.0+
Closed, DeclinedPublic

Description

The MySQL 4.0-backwards-compatible default schema causes a lot of problems on MySQL 4.1 and higher:

  • confusing results when doing direct queries
  • difficulty sharing data with other applications
  • data corruption with 'mysqldump'

The 'MySQL 5'-mode schemas avoid these problems, and are finally working properly with the standard updaters:

  • The binary schema preserves full compatibility with non-BMP Unicode characters (and thus data from Wikimedia sites)
  • The UTF-8 schema is limited by MySQL's BMP-only support, but still seems reasonably functional.

However, the old schema is still the default, and there's currently no provision for cross-upgrading from the old schema to one of the new ones. The installer/updater should be able to detect the old-style schema and convert. A converter maintenance script might also be helpful.

At the worst, going row-by-row and doing UTF-8 and double-UTF-8 checks and converting line-by-line should do the job. :)

Once conversion is in place, we can consider using the more capable binary schema as the default.


Version: unspecified
Severity: enhancement

Details

Reference
bz13675

Event Timeline

bzimport raised the priority of this task from to Low.Nov 21 2014, 10:01 PM
bzimport set Reference to bz13675.
bzimport added a subscriber: Unknown Object (MLST).

1.18 was the last version to support MySQL 4.
I guess this is a request that could realistically be a WONTFIX nowadays?

Paladox raised the priority of this task from Low to Needs Triage.
Paladox set Security to None.
Paladox added a subscriber: Aklapper.
Paladox subscribed.

Mediawiki doesn't support MySQL 4.0 any more.