Page MenuHomePhabricator

Fix and add integration tests for language analyzers
Closed, ResolvedPublic

Description

Analysis components can be declared as prebuilt or configurable, prebuilt version are the most efficient as they are prebuilt and a single instance of the factory is present in mem but they are not usable as a custom tokenfilter.
This task is about unifying how we declare token filters possibly only using prebuilt token filters.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
dcausse triaged this task as Medium priority.

Change 489177 had a related patch set uploaded (by DCausse; owner: DCausse):
[search/extra-analysis@master] Add serbian_stemmer as a configurable token filter

https://gerrit.wikimedia.org/r/489177

Change 489179 had a related patch set uploaded (by DCausse; owner: DCausse):
[mediawiki/extensions/CirrusSearch@master] Use prebuilt serbian_stemmer

https://gerrit.wikimedia.org/r/489179

Change 489182 had a related patch set uploaded (by DCausse; owner: DCausse):
[search/extra-analysis@master] Add integration tests for esperanto_stemmer

https://gerrit.wikimedia.org/r/489182

Change 489201 had a related patch set uploaded (by DCausse; owner: DCausse):
[mediawiki/extensions/CirrusSearch@master] Use prebuilt slovak stemmer

https://gerrit.wikimedia.org/r/489201

Change 489202 had a related patch set uploaded (by DCausse; owner: DCausse):
[search/extra@master] Add prebuilt version of the slovak stemmer

https://gerrit.wikimedia.org/r/489202

Change 489215 had a related patch set uploaded (by DCausse; owner: DCausse):
[mediawiki/extensions/CirrusSearch@es6] Use prebuilt serbian_stemmer

https://gerrit.wikimedia.org/r/489215

Change 489216 had a related patch set uploaded (by DCausse; owner: DCausse):
[mediawiki/extensions/CirrusSearch@es6] Use prebuilt slovak stemmer

https://gerrit.wikimedia.org/r/489216

Change 489201 abandoned by DCausse:
Use prebuilt slovak stemmer

Reason:
wrong branch

https://gerrit.wikimedia.org/r/489201

Change 489179 abandoned by DCausse:
Use prebuilt serbian_stemmer

Reason:
wrong branch

https://gerrit.wikimedia.org/r/489179

Change 489202 merged by jenkins-bot:
[search/extra@master] Add prebuilt version of the slovak stemmer

https://gerrit.wikimedia.org/r/489202

Change 489177 merged by jenkins-bot:
[search/extra-analysis@master] Add serbian_stemmer as a configurable token filter

https://gerrit.wikimedia.org/r/489177

Change 489182 merged by jenkins-bot:
[search/extra-analysis@master] Add integration tests for esperanto_stemmer

https://gerrit.wikimedia.org/r/489182

Change 489215 merged by jenkins-bot:
[mediawiki/extensions/CirrusSearch@es6] Use prebuilt serbian_stemmer

https://gerrit.wikimedia.org/r/489215

Change 489216 merged by jenkins-bot:
[mediawiki/extensions/CirrusSearch@es6] Use prebuilt slovak stemmer

https://gerrit.wikimedia.org/r/489216