Page MenuHomePhabricator

Results to missing articles should issue a delete job to the index
Closed, DeclinedPublic

Description

Mostly hypothetical, but in theory we could end up with articles that don't get deleted from the index. I'm thinking mainly in the situation where you start over on a wiki but don't prune the Elasticsearch index--that is: you had a wiki named foowiki, stopped, and are starting foowiki again. The old articles will never be pruned since nobody can delete non-existing articles :)

When we get such a result we already don't display it (since we check for page existence application side after fetching our results), so it shouldn't be too terribly hard to insert a low-priority delete job around that point.

Event Timeline

Restricted Application added subscribers: StudiesWorld, Aklapper. · View Herald Transcript
demon triaged this task as Lowest priority.Feb 24 2016, 3:50 PM
debt subscribed.

Our sanitizer will fix issues like this over time, so not a high priority.

MPhamWMF subscribed.

Closing out low/est priority tasks over 6 months old with no activity within last 6 months in order to clean out the backlog of tickets we will not be addressing in the near term. Please feel free to reopen if you think a ticket is important, but bare in mind that given current priorities and resourcing, it is unlikely for the Search team to pick up these tasks for the indefinite future. We hope that the requested changes have either been addressed by or made irrelevant by work the team has done or is doing -- e.g. upgrading Elasticsearch to a newer version will solve various ES-related problems -- or will be subsumed by future work in a more generalized way.

RhinosF1 removed a project: Discovery-Search.
RhinosF1 subscribed.

Re-opening tasks and removing from team workboard per IRC feedback given yesterday and discussion with MPham.

dcausse subscribed.

Superseded by the saneitzer