From the mailing list:
From Lars Aronsson firstname.lastname@example.org via lists.wikimedia.org
This is a suggestion to change search, so it ignores postfix accents. Russian dictionaries (including Wiktionary) use accents to indicate stress on syllables, but these accents are never seen in plain text. In Russian Wiktionary, the verb бороться has the inflected form боритесь (imperative, plural), which does not have an entry of its own, but appears in a fact box (table) of inflected forms. However, since this is a dictionary, the word in the box is written with an accent: бори́тесь https://ru.wiktionary.org/wiki/бороться (I do realize that it would be possible to add redirect entries for all such inflected forms, but this has not been done in ru.wiktionary.) Searching for бори́тесь (which nobody would do) finds the relevant page, https://ru.wiktionary.org/w/index.php?search=бори́тесь but searching for боритесь (the normal thing) does not find the relevant page, https://ru.wiktionary.org/w/index.php?search=боритесь Note that Unicode doesn't contain accented versions of Cyrillic letters. Instead, the accent is made by suffixing a separate accent sign. $ echo "и" | od -c 0000000 320 270 \n $ echo "и́" | od -c 0000000 320 270 314 201 \n
Nik thinks this might be something we can get out of the unicode normalizer. We should have a look here.