Page MenuHomePhabricator

Create cookbook to reindex into elasticsearch / cirrus
Closed, DeclinedPublic

Description

We have a documented procedure to reindex after an outage, which is quite simple as it is. Still, it would be nice to create a cookbook for it, so that it is available centrally with the other operational procedures

Event Timeline

fgiunchedi triaged this task as Medium priority.Apr 9 2019, 8:38 AM
Gehel raised the priority of this task from Medium to High.Sep 1 2020, 7:08 PM

Some potential nice-to-have features, from recent discussions about reindexing problems:

  • Catching reindexing failures in general and issuing an alert/warning. Right now the best/only/horrible way is something like grep -ilP "Reindex task was not successful|fail|error|warn" *log.
  • Check for multiple indexes: Alert, possibly before doing anything else, if there are multiple indexes for a given wiki. Alert if, after reindexing, there are multiple indexes.
  • Alert If the index didn't change from before reindexing to after (the index name is formatted as <wiki>_<timestamp> so it should change).

I've moved this ticket back to "needs triage" so we can discuss it again in light of the recent problems with T274200, and decide if we should make it more of a priority, and maybe consider a different approach than cookbooks (so non-SREs can reindex).

If this ticket should only be for outage recovery, I can create a new ticket to cover the more general reindexing case.

Moving this out of the current work, but this is still a priority for us. Will revisit next quarter.

Gehel removed the point value for this task.

Per IRC conversation with @dcausse , we now have an alternate way of reindexing that does not involve cookbooks . As such, we can close out this ticket.

bking claimed this task.
Gehel changed the task status from Resolved to Declined.Fri, Nov 29, 9:40 AM
Gehel moved this task from Done to Reported on the Data-Platform-SRE (2024.11.09 - 2024.11.29) board.