Page MenuHomePhabricator

Definitions of words get indexed in search
Closed, ResolvedPublic

Description

Some of our customers have run into issues where content that shows up in the search results for a page doesn't actually exist in the page. We root caused this issue to Lingo definitions being indexed as they are part of the Parser output. We would like to either 1) remove all Lingo definitions from the search index, or 2) Add a config option to turn this removal on.

I can open a patch with either solution if it's something the community would be interested in.

Thanks!

Details

Related Gerrit Patches:
mediawiki/extensions/Lingo : masterRemoves Lingo definitions from search results

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 18 2019, 4:16 PM

Nice catch!
I don't think a config option is necessary, but if you could provide a patch implementing the first proposal, that would be really appreciated.

Change 544229 had a related patch set uploaded (by Juan Osorio (Microsoft); owner: Juan Osorio (Microsoft)):
[mediawiki/extensions/Lingo@master] Removes Lingo definitions from search results

https://gerrit.wikimedia.org/r/544229

Change 544229 merged by Foxtrott:
[mediawiki/extensions/Lingo@master] Removes Lingo definitions from search results

https://gerrit.wikimedia.org/r/544229

Reedy renamed this task from [Lingo] Definitions of words get indexed in search to Definitions of words get indexed in search.Oct 18 2019, 11:48 PM
Foxtrott closed this task as Resolved.Oct 21 2019, 6:41 AM