Page MenuHomePhabricator

Investigate enabling Nynorsk Light Stemmer
Open, MediumPublic

Description

While looking into T147959, I noticed that both Bokmål (nb) and Nynorsk (no) are explicitly configured to use the Norwegian language analyzer. Elastic has a Nynorsk light stemmer, which might do better for Nynorsk than the standard Norwegian analysis, which is listed as being for Bokmål.

This would be a test of the differences caused by changing nn.wikipedia.org to the Nynorsk analyzer, probably including speaker review (unless it is obviously horrible). If it looks like an improvement, we would deploy and re-index.