Page MenuHomePhabricator

Single letter tokens suffixed to article text in search
Open, MediumPublic

Description

On https://en.wikipedia.org/wiki/Waiting_at_the_Royal the indexed text field is suffixed with v t e v t e. This creates single letter tokens that end up getting used by morelike. There is no immediately obvious reason for these tokens from looking at the rendered webpage, likely something in the parser html -> plaintext conversion is incorrect.

Event Timeline

This might be a useless comment however "vte" reminded me of a common header in some embedded templates providing quick links to "view", "talk" or "edit" the template as stand-alone. (I see there's no such template on affected the page but maybe still provides a hint. Or not.)

Gehel triaged this task as Medium priority.Mon, May 6, 3:46 PM
Gehel moved this task from needs triage to elastic / cirrus on the Discovery-Search board.