Page MenuHomePhabricator

BanConverter (Balinese language converter) includes XML tags for punctuation
Open, Needs TriagePublic

Description

Certain punctuation characters in BanConverter are defined to translate to XMLish <g> tags:

# punctuation
᭚ → '<g type="panti"/>'; # PANTI
᭛ → '<g type="pamada"/>'; # PAMADA
᭜ → \@; # WINDU
᭝ → '<g type="pamungkah"/>'; # CARIK PAMUNGKAH
᭞ → \,; # CARIK SIKI
᭟ → \,\,; # CARIK PAREREN
᭠ → '<g type="pameneng"/>'; # PAMENENG
\u1B7D → '<g type="pantiLantang"/>';
\u1B7E → '<g type="pamadaLantang"/>';
EOF;

This doesn't actually work properly. In Parsoid, the tags are rendered as literal text, eg for:
https://ban.wikipedia.org/w/index.php?title=%E1%AC%93%E1%AC%A9%E1%AC%B8%E1%AC%A7%E1%AC%A2%E1%AC%BE%E1%AC%A6%E1%AD%84%E1%AC%A9%E1%AC%82%E1%AC%AE%E1%AC%B6&variant=ban-x-dharma&useparsoid=1&oldid=105938
we get:

image.png (497×420 px, 76 KB)

Whereas the legacy parser's implementation of language converter actually opened a new <g> tag in the HTML:
image.png (172×500 px, 60 KB)

I don't know what the proper transliteration of these punctuation marks are, but I'm pretty sure literal <g type="panti"/> is not the expected rendering.

This appears to have been present since the first revision of BanConverter in 9cb51879444cd5a72c60eec9b30a37c590e2330c for T263082: add <langconvert> parser tag.

Event Timeline

Hi, thanks for notice this @Bennylin

For this problem, my suggestion, the best proposed way to do this is to delete all "ban-x-dharma", "ban-x-palmleaf" and "ban-x-pku" (Puri Kauhan Ubud) for Balinese wikipedia as it is actually not used on the site, nor, any academic based on that transliteration. All the transliteration is mostly used in Balinese wikisource for special request back then by @kamholz and the team while migrating the Palm-leaf wiki which used this transliteration to Wikisource. I would like just have <ban> and <ban-bali> only for Balinese wikipedia and other sites on wikimedia project.

Thanks.

I think as a start we could make the extra variants available only via mediawiki configuration option, so that we can enable them on wikisource but not on the other projects (wikipedia, other sister projects).