Page MenuHomePhabricator

CirrusSearch: If the search didn't find many results then rerun the suggester with more time consuming settings
Open, Needs TriagePublic

Description

After https://gerrit.wikimedia.org/r/#/c/119041/ we will be running the suggester with faster, less accurate settings. They'll be good enough to find lots of suggestions, but not all.

Proposal: if the search didn't find many results then rerun the suggester with more aggressive settings. Stuff like:
prefix_length: 1,
max_errors: 3,
confidence: 1.1,

Also, we might want to only do this if the suggest run with the query didn't find anything. Not sure. We could also check the score on the result and use some cutoff. Dunno.

Note: prefix_length and max_errors can cause much more CPU usage. I'm suggesting just a single notch more on each setting but that still would be quite a bit.


Version: unspecified
Severity: normal

Details

Reference
bz62726

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:55 AM
bzimport added a project: CirrusSearch.
bzimport set Reference to bz62726.
bzimport added a subscriber: Unknown Object (MLST).
demon removed a subscriber: demon.Aug 19 2015, 3:42 PM
Restricted Application added a project: Discovery. · View Herald TranscriptAug 19 2015, 3:42 PM
Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Restricted Application added a subscriber: StudiesWorld. · View Herald TranscriptDec 10 2015, 11:03 PM