Page MenuHomePhabricator

Add a "reverse" suggestion field to workaround the prefix length limitation (typos suggestion)
Closed, ResolvedPublic

Description

Today we use a suggestion field that is built against the title and redirect data. This field is used by the phrase suggester with a prefix length of 2. In other words if the typo appears in the first two characters all suggestions generated by the phrase suggester would be bad. Reducing the prefix length can hurt performances, another technique is to build a second suggestion field with a "reverse" filter.

Event Timeline

dcausse raised the priority of this task from to Needs Triage.
dcausse updated the task description. (Show Details)
dcausse subscribed.
Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Change 227478 had a related patch set uploaded (by DCausse):
WIP: Add reverse suggestion field

https://gerrit.wikimedia.org/r/227478

dcausse changed the task status from Open to Stalled.Jul 30 2015, 2:33 PM

We need to test if it's worth a try.

There's an issue in PhraseSuggester: https://github.com/elastic/elasticsearch/pull/12670

I will update the patch in cirrus to workaround the bug.

Looks like there's no workaround for this bug :(
moving to backlog until we have elasticsearch 1.7.2

Deskana subscribed.

We will have elastic 1.7.5 soon in production, I think it'd be nice to finish what we started on this task.
The top query for the last week of January 2016 was Terry Wigan which is certainly a typo to Terry Wogan (a radio and TV broadcaster who died this week).
I think that the reverse field would have helped in this case.

debt changed the task status from Stalled to Open.Aug 4 2016, 5:04 PM
debt removed a project: Discovery-Search.

Change 227478 merged by jenkins-bot:
Increase phrase suggester recall

https://gerrit.wikimedia.org/r/227478