User Story: As a user of Estonian-language wikis, I want to have better Estonian language analysis so I see better search results (particularly, better recall).
Elasticsearch provides a Estonian language analyzer, but we don't currently use it for Estonian-language projects. We should enable it, have the performance verified by speakers, and then unpack it.
Acceptance Criteria:
- Estonian speakers verify reasonable performance of the stemmer
- Unpacked analyzer performs the same as the monolithic version (without general upgrades).
- Upgraded analyzer either has no unexpected impact (we know what to expect from ICU norm and homoglyph norm, for example), or the impact is reviewed by a speaker of the language.