Page MenuHomePhabricator

Unicode: detect Zawgyi encoding for Burmese strings
Open, LowPublic

Description

In Myanmar, many users have broken fonts that don’t correctly implement Unicode (‘Zawgyi’); see https://www.unicode.org/faq/myanmar.html Unicode Myanmar FAQ. Could Lingua Libre run a Zawgyi detector on entered Burmese text, in particular on the words to be spoken, and reject strings (or display a warning) if they’re likely mis-encoded? Google has an open-source Zawgyi detector in PHP, JavaScript, and other languages. This would prevent Burmese users from entering garbage characters that cannot be displayed by non-Burmese systems.

Event Timeline

Sorry, here’s the correct link to the Unicode FAQ about Zawgyi: https://www.unicode.org/faq/myanmar.html

Yug triaged this task as Low priority.Jul 6 2022, 10:24 PM
Yug moved this task from Backlog to Datasets and mass download on the Lingua-Libre-Legacy board.
Yug renamed this task from Detect Zawgyi encoding for Burmese strings to Uniode: detect Zawgyi encoding for Burmese strings.Jul 7 2022, 11:36 AM
Yug renamed this task from Uniode: detect Zawgyi encoding for Burmese strings to Unicode: detect Zawgyi encoding for Burmese strings.