Page MenuHomePhabricator

Adding new font for CJK media display
Open, LowPublic

Description

I am currently expanding on Periodic Table on Wikimedia Commons and encounted an issue that the newest characters as defined in Unicode 11.0 (e.g. U+9FEB 鿫, U+9FEC 鿬) are not displaying correctly on the SVG files.

I would like to suggest the rendering engine to be equipped with better CJK font (suggesting Source Han Sans which has a large coverage of CJK characters) so that those characters can be display correctly, and also any future SVG display with CJK characters.

Event Timeline

Note that Source Han Sans is licensed under the SIL Open Font License.

The linked ticket was back in jessie, but that's accurate.

From a random appserver on buster nowadays:

[mw2300:~] $ dpkg -l | grep cjk
ii  fonts-noto-cjk                       1:20170601+repack1-3+deb10u1                                        all          "No Tofu" font families with large Unicode coverage (CJK regular and bold)

This is https://packages.debian.org/buster/fonts-noto-cjk

so maybe it would have to be an upstream bug against that package

cc: @Muehlenhoff

The upstream repo is https://github.com/googlefonts/noto-cjk
We're currently running version 20170601, so this might be fixed in the 20201206-cjk release in Debian bullseye.

@NFSL2001 Do you have an example file with the Unicode characters you're missing? We can try rendering it on a system with the 20201206-cjk font and if that fixes it, consider a backport.

I have changed the SVG to use Noto Sans instead of Source Han Sans first and is waiting for the preview image to load.

https://commons.wikimedia.org/wiki/File:Periodic_table_zh-tw.svg
https://commons.wikimedia.org/wiki/File:Periodic_table_zh-hk.svg

@MoritzMuehlenhoff Is there any updates on substituting in noto-cjk? The files are still not loading the correct preview characters.

Sample of Unicode characters missing in svg: 鿬鿫

Files on commons that are using these characters:
https://commons.wikimedia.org/wiki/File:Periodic_table_zh-tw.svg
https://commons.wikimedia.org/wiki/File:Periodic_table_zh-hk.svg

Files on commons that will/should be using these characters:
https://commons.wikimedia.org/wiki/File:Periodic_table_zh-hans.svg (currently using traced diagram)

@MoritzMuehlenhoff Any updates for this? The preview character is still missisng and problamatic to Chinese users.

Here is a screenshot of how it should look for last 2 elements:

correct display.png (427×631 px, 74 KB)

Here is what is shown in preview:

missing.png (339×578 px, 31 KB)

Here are the character: 鿬鿫