Page MenuHomePhabricator
Paste P8995

Khmer samples

Authored by TJones on Aug 28 2019, 2:32 PM.
Referenced Files
F30142892: screen.png
Aug 28 2019, 4:02 PM
F30142784: khmer.png
Aug 28 2019, 2:57 PM
F30142760: Windows 2008, Chrome 73.png
Aug 28 2019, 2:46 PM
F30142738: khmer.JPG
Aug 28 2019, 2:44 PM
F30142722: Screen Shot 2019-08-28 at 10.33.44 AM.png
Aug 28 2019, 2:35 PM
F30142701: raw.txt
Aug 28 2019, 2:33 PM
F30142690: raw.txt
Aug 28 2019, 2:32 PM
ញ៉ាំ ញុាំ ញំុា ញាុំ ញំាុ
ញ៉ាំ ញំ៉ា ញា៉ំ ញំា៉
ខ្មែរ ខែ្មរ ក​ស្ទួ កស្ទួ សួ្ទ
ម្លាំ មា្លំ ម្លំា មាំ្ល មំា្ល មំ្លា
ម្បី មី្ប បុ៉ ប៉ុ ច្ចុ ចុ្ច
ណ្ណោះ ណោ្ណះ ណ្ណះោ ណោះ្ណ
ង្ស៊ី ង្សី៊ ង៊្សី ង៊ី្ស ងី្ស៊ ងី៊្ស
ថំៅ ថៅំ ថាំ ថំា
ន្ត្រី ន្រ្តី ន្តី្រ នី្ត្រ ធ្វើ ធើ្វ ចា៎ះ ចាះ៎ ច៎ាះ
គ៌឵ ៜ្ខ

Event Timeline

TJones edited the content of this paste. (Show Details)

My screenshot (OSX 10.14 / Chrome 76)

Screen Shot 2019-08-28 at 10.33.44 AM.png (1×1 px, 135 KB)

My screenshot from Windows 10/Version 76.0.3809.100 (Official Build) (64-bit)

khmer.JPG (873×1 px, 81 KB)

From Browsershots—and thus a bit lo-res—Windows 2008, Chrome 73

Windows 2008, Chrome 73.png (712×752 px, 102 KB)

screen.png (395×386 px, 27 KB)

Mac OS 10.13.6 (High Sierra), Firefox 68.0.2

Thanks, gang! These are helpful!

BTW, Erik asked today how other search engines handle this kind of variation. Looks like some of the big U.S. ones don't.

The first two items on line 5 are:

Note that the first form is more correct because the characters are (typed) in pronunciation order, though both look the same to everyone.

Baidu gets it right, though the exact string affects ranking: ខ្មែរ vs ខែ្មរ—both get 639K results. In the second case, the second result is higher because it is an exact match.