Apparently WIkimedia wikis' search does not ignore soft hyphens (U+00AD, ­ ), making it impossible to find occurences of words if they were written with the hyphen in them. I've been told this is a problem for Wikisources, as OCR applications often generate these hyphens.
Description
Description
Event Timeline
Restricted Application added a project: Discovery-ARCHIVED. · View Herald TranscriptJun 14 2015, 10:06 AM2015-06-14 10:06:01 (UTC+0)
TheDJ moved this task from Backlog to CirrusSearch on the MediaWiki-Search board.Sep 6 2025, 9:05 PM2025-09-06 21:05:17 (UTC+0)
Restricted Application added a project: Discovery-Search. · View Herald TranscriptSep 6 2025, 9:05 PM2025-09-06 21:05:18 (UTC+0)
Gehel edited projects, added Discovery-Search (2025.09.05 - 2025.09.26); removed Discovery-Search.Sep 8 2025, 3:36 PM2025-09-08 15:36:53 (UTC+0)
TJones claimed this task.
TJones subscribed.
Comment ActionsIn the intervening time, this has been taken care of by the icu_normalize filter which is enabled everywhere.
Gehel moved this task from Incoming to Done on the Discovery-Search (2025.09.05 - 2025.09.26) board.Sep 8 2025, 3:37 PM2025-09-08 15:37:11 (UTC+0)
pfischer moved this task from Done to Reported on the Discovery-Search (2025.09.05 - 2025.09.26) board.Sep 12 2025, 9:31 AM2025-09-12 09:31:08 (UTC+0)