Page MenuHomePhabricator

Devanagari (Nepali) Text not properly rendered in PDF created by Collection extension
Closed, ResolvedPublic

Description

Thanks a lot for enabling Collection Extension in Nepali Wikipedia , it's really useful but we are facing issues since the pdf file generated is not much useful .

OS : Windows and Mac , Browsers : Firefox and Chrome

Steps to Reproduce the bug :

Actual result :

  • PDF extension is working on a straight forward manner ( byte stream ) without proper rendering or lay outing

Expected result :

  • Should support Devanagari rendering rather than just creating the pdf on the basis of byte stream please see the examples

Screen_Shot_2015-03-16_at_4.18.14_PM.png (84×1 px, 72 KB)

and
Screen_Shot_2015-03-16_at_4.18.08_PM.png (98×1 px, 69 KB)

Suggestion : Text Shaping engines like harfbuzz : http://www.freedesktop.org/wiki/Software/HarfBuzz/ pretty well for Devanagari including Nepali

Ref:

#http://unicode.org/charts/PDF/U0900.pdf
# https://www.microsoft.com/typography/otfntdev/devanot/appen.aspx
#http://www.unicode.org/L2/L1999/99235.pdf
#http://unicode.org/review/pr-37.pdf

Event Timeline

sarojdhakal assigned this task to demon.
sarojdhakal raised the priority of this task from to Medium.
sarojdhakal updated the task description. (Show Details)
sarojdhakal added subscribers: gerritbot, Glaisher, tomasz and 4 others.

@sarojdhakal: As you added the "Patch-For-Review" project here, please link to your patch in Gerrit that awaits review. Removing that project for the time being.

Aklapper renamed this task from Devanagari ( Nepali ) Text is not properly rendered in the pdf created by the Collection extension to Devanagari (Nepali) Text not properly rendered in PDF created by Collection extension.Mar 16 2015, 11:39 AM
Aklapper raised the priority of this task from Medium to Needs Triage.
Aklapper set Security to None.

@sarojdhakal: Did you talk to demon before, or why did you assign this task to him?

Just created the subtask under the task to enable collection extension, those are the default options taken, I did not modify it . I do not have patch to submit at the moment

demon subscribed.

Apparently:

There is no issue like this in Hindi Wikipedia :

But in All other devanagari based wikipedias :

  1. Newari Wikipedia :
  2. Nepali Wikipedia :
  3. Maithili Wikipedia :

Any customization to collection extension in Hindi wikipedia that could possibly fix this issue

@sarojdhakal: Did you talk to Dereckson before, or why did you assign this task to him?

Dereckson subscribed.

By lack of rationale for the assignment (and as I'm not currently working on this issue), I reset this task's assignee.

If I'm not mistaken, @cscott is the main maintainer of this extension.

If Hindi works, other Devanagari-based languages should work, too, so it may be just a simple configuration issue, but I'm really not an expert.

Yes, this is an issue with the mw-ocg-latexer package. We do have Devanagari support, so this is probably just a matter of adding the right configuration for newiki.

Change 212438 had a related patch set uploaded (by Cscott):
T92817: improve text rendering for nepali languages.

https://gerrit.wikimedia.org/r/212438

Change 212438 merged by jenkins-bot:
T92817: improve text rendering for nepali languages.

https://gerrit.wikimedia.org/r/212438

cscott claimed this task.

Biplab Anand says: "After the patch to OCG. The PDF export results looks pretty good. Happy to say the task is resolved after so long."