Page MenuHomePhabricator

CirrusSearch doesn't stem aliases to alias
Closed, ResolvedPublic

Description

CirrusSearch doesn't stem aliases to alias - just "aliase" which isn't right.


Version: unspecified
Severity: normal

Details

Reference
bz54811

Event Timeline

bzimport raised the priority of this task from to Low.Nov 22 2014, 2:29 AM
bzimport added a project: CirrusSearch.
bzimport set Reference to bz54811.
bzimport added a subscriber: Unknown Object (MLST).

Looks like I can fix this by switching the porter stemmer with the kstemmer. I'll do some more research.

So the problem is that the porter stemmer which we use right now stems "alias" to "alia" and "aliases" to "alias". They really need to stem to the same place.

I'm going to switch us to the kstemmer. It was designed as an improvement on the porter stemmer and in my somewhat limited testing it seems better.

Change 86854 had a related patch set uploaded by Manybubbles:
Tests for places where kstem beats porter stemmer.

https://gerrit.wikimedia.org/r/86854

Change 86854 merged by jenkins-bot:
Tests for places where kstem beats porter stemmer.

https://gerrit.wikimedia.org/r/86854

verified on test2wiki