Page MenuHomePhabricator

Add Bengali currency/hexadecimal numerals to pagelist
Open, Needs TriagePublic

Description

In books of Bengali Wikisource, Bengali numbers are used as page numbers. For front matter pages, similar to roman numerals, Bengali currency numerals are used in old books.

Example having both types: https://bn.wikisource.org/wiki/নির্ঘণ্ট:গৌড়রাজমালা.djvu

So provision of these two types of numeral system is required for pagelist creation.

General Bengali numerals follow a decimal system ("0123456789" = "০১২৩৪৫৬৭৮৯") and thus adding support for "beng" numerals from the ICU, similarly to how the change from T268906 allows use of "thai" numerals, would suffice for this.

Bengali currency numerals, on the other hand, follow a hexadecimal system as follows:

i=৴৹
ii=৵৹
iii=৶৹
iv=৷৹
v=৷৴৹
vi=৷৵৹
vii=৷৶৹
viii=৷৷৹
ix=৷৷৴৹
x=৷৷৵৹
xi=৷৷৶৹
xii=৸৹
xiii=৸৴৹
xiv=৸৵৹
xv=৸৶৹
xvi=১৲

(Note how xvi uses the regular Bengali numeral "১" with the currency zero placeholder "৲".) The values from xvii-xxxii follow a similar pattern to the above with a prefix "১", except that xxxii will use a prefix "২" instead of a prefix "১". All values higher than xv are thus effectively written by rendering the number divided by 16 without remainder in regular Bengali numerals, rendering the number modulo 16 in currency numerals, and concatenating the two (so that clxviii becomes ১০৷৷৹). A custom ruleset à la those from https://github.com/unicode-org/cldr/blob/master/common/rbnf/root.xml will likely be necessary for this.

Related Objects

Event Timeline

@Bodhisattwa: Is this language specific, or is this about all and any All-and-every-Wikisource ? (See project description.)

@Bodhisattwa: Could you please answer the last comment? Thanks.

@Aklapper , this is mainly a request for Bengali Wikisource books, but also can be used for bilingual books present in other Wikisources as well.

Tpt claimed this task.
This comment was removed by Tpt.

Sorry, I confused this task with an other one.

Change 655989 had a related patch set uploaded (by Tpt; owner: Tpt):
[mediawiki/extensions/ProofreadPage@master] Allows formatting in a lot of indian systems inside of the <pagelist> tags

https://gerrit.wikimedia.org/r/655989

Change 655989 merged by jenkins-bot:
[mediawiki/extensions/ProofreadPage@master] Allows formatting in a lot of indian systems inside of the <pagelist> tags

https://gerrit.wikimedia.org/r/655989

Done and deployed

I donot see the Bengali hexadecimal numerical system included in the widget.

@Inductiveload I believe @Bodhisattwa says that the "beng" option is not visible in the pagelist editing widget.

@Bodhisattwa To add the bengali numeral option to the pagelist editing widget you need to edit MediaWiki:Proofreadpage_pagelist_dropdown_values.json and add:

{
    "label": "Bengali numeral",
    "data": "beng"
},

where "Bengali numeral" is the label that will be displayed in the editing widget and should be translated to bengali. If you want I can do that if you give the the expected displayed label for this option.

Sorry, I made a confusion between two different numerical systems: the ICU "beng" numerals are defined as the 10 digits "০১২৩৪৫৬৭৮৯" and, so, are not using base 16 as the currency numerals requested by this task. The base 10 numerical system is properly implemented and available now, the base 16 numerals are not yet availlable and not provided by ICU.

I am a bit scared of adding a numerical system that is not in ICU. We did that with roman numerals and we ended up with a naming clash with ICU ("roman" is upper case in ICU and lowercase in ProofreadPage...).

Mahir256 renamed this task from Add Bengali numerals to pagelist to Add Bengali currency/hexadecimal numerals to pagelist.Nov 21 2021, 9:44 PM
Hrishikes updated the task description. (Show Details)