Page MenuHomePhabricator

Search for Turkish İ - Unicode Character LATIN CAPITAL LETTER I WITH DOT ABOVE - U+0130 İ fails
Closed, ResolvedPublic

Description

Author: gangleri

Description:
Halló!

İ is
http://www.fileformat.info/info/unicode/char/0130/index.htm
Unicode Character LATIN CAPITAL LETTER I WITH DOT ABOVE - U+0130
TML Entity (decimal) İ (hex) İ
UTF-8 (hex) 0xC4 0xB0 (c4b0) &c4%b0 &C4%B0

Please search for İ :
http://de.wikipedia.org/wiki/Spezial:Search?ns1=1&search=%C4%B0&fulltext=Suche

This will not find
http://de.wikipedia.org/wiki/Diskussion:T%C3%BCrkische_Sprache#html_und_.C4.B1
Please note that İ is included as Unicode character in
[[de:talk:Diskussion:Türkische_Sprache]] and is not encoded in &#nnnn; or
&#xnnnn; notation.

Please search for ı :
http://de.wikipedia.org/wiki/Spezial:Search?ns1=1&search=%C4%B1&fulltext=Suche

This will find more results.

Best regards reinhardt [[user:gangleri]]

Please not that you can search for "İ" at MediaZilla but you will get "false
positives". However the Bugzilla CVS version will fail:

https://bugzilla.mozilla.org/show_bug.cgi?id=321427
[Bug Bugzilla 321427] Advanced search for Turkish İ - Unicode Character
LATIN CAPITAL LETTER I WITH DOT ABOVE - U+0130 İ fails


Version: unspecified
Severity: normal
URL: http://de.wikipedia.org/wiki/Spezial:Search?ns1=1&search=%C4%B0&fulltext=Suche

Details

Reference
bz4379

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 9:01 PM
bzimport added a project: MediaWiki-Search.
bzimport set Reference to bz4379.
bzimport added a subscriber: Unknown Object (MLST).

alefzet wrote:

This is affected also Azerbayjan, Tatarish, Crimea-Tatar, Gagauz, Kazakh languages. All they use
turkic alphabet with uppercase/lowercase pairs of İi and Iı. It relates also lc/uc, ucfirst
magic words

  • This bug has been marked as a duplicate of bug 4430 ***