Page MenuHomePhabricator

[TUX] Special:SearchTranslations: improve tokenization (word boundaries)
Closed, ResolvedPublic

Description

Searching "GENDER" finds nothing useful because it returns only exact matches, not only by case but also from space to space, so to find a string like "{{GENDER:$1|sò|sò|sò}}" you have to use it all as keyword, or to use the wildcard: *GENDER*
Precision is nice but at the very least things like brackets, pipes etc. should be considered word boundaries. When a decision is made, add info on how to use wildcards etc.


Version: master
Severity: enhancement
URL: https://translatewiki.net/w/i.php?title=Special%3ASearchTranslations&query=GENDER
See Also:
https://bugzilla.wikimedia.org/show_bug.cgi?id=46117

Details

Reference
bz46118

Event Timeline

bzimport raised the priority of this task from to Normal.Nov 22 2014, 1:17 AM
bzimport set Reference to bz46118.
Nemo_bis created this task.Mar 14 2013, 2:34 PM

(In reply to comment #0)

or to use the
wildcard: *GENDER*
Precision is nice but at the very least things like brackets, pipes etc.
should
be considered word boundaries. When a decision is made, add info on how to
use
wildcards etc.

Self-answer: now * doesn't work (or rather, not beyond word level e.g. spaces), ? does.
https://lucene.apache.org/core/2_9_4/queryparsersyntax.html
Compare:
https://translatewiki.net/w/i.php?title=Special%3ASearchTranslations&query=Invia+apprezzamento
https://translatewiki.net/w/i.php?title=Special%3ASearchTranslations&query=Invi%3F
https://translatewiki.net/w/i.php?title=Special%3ASearchTranslations&query=Invia*apprezzamento
https://translatewiki.net/w/i.php?title=Special%3ASearchTranslations&query=Invia+*amento