Page MenuHomePhabricator

Accent mark should not be recognized as \b (word boundary)
Closed, DeclinedPublic

Description

In this edit
https://ru.wikipedia.org/w/index.php?title=%D0%AF%D1%82%D1%8C&diff=prev&oldid=76535525
AWB commit replacement "\bнее\b" -> "неё" in word "си<unicode accent mark>нее", it's wrong behavior.

Event Timeline

Rjwilmsi changed the task status from Open to Stalled.Feb 21 2016, 3:00 PM
Rjwilmsi subscribed.

@MaxBioHazard if you used \b in a Regex then the definition of what \b matches or not is part of the .NET Framework from MicroSoft, not something we configure for AWB, so you will need to change your regex if you don't want to match certain characters.