Page MenuHomePhabricator

Search does not ignore soft hyphens (U+00AD, ­ )
Closed, ResolvedPublic

Description

Apparently WIkimedia wikis' search does not ignore soft hyphens (U+00AD, ­ ), making it impossible to find occurences of words if they were written with the hyphen in them. I've been told this is a problem for Wikisources, as OCR applications often generate these hyphens.

Event Timeline

matmarex raised the priority of this task from to Needs Triage.
matmarex updated the task description. (Show Details)
matmarex added a project: MediaWiki-Search.
matmarex added subscribers: matmarex, Ankry.
Restricted Application added a subscriber: Aklapper. · View Herald Transcript
TJones claimed this task.
TJones subscribed.

In the intervening time, this has been taken care of by the icu_normalize filter which is enabled everywhere.