Page MenuHomePhabricator

tune elasticsearch recovery settings
Closed, ResolvedPublic

Description

During our last cluster restart, the cluster failed to recover. See incident documentation for details.

To prevent this from happening again, we should probably reduce recovery_after_time to 5 minutes.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript
debt triaged this task as High priority.
debt edited projects, added Discovery-Search (Current work); removed Discovery-Search.

Change 380524 had a related patch set uploaded (by Gehel; owner: Gehel):
[operations/puppet@production] elasticsearch: only wait 5 minutes for all nodes in case of cold restart

https://gerrit.wikimedia.org/r/380524

Change 380524 merged by Gehel:
[operations/puppet@production] elasticsearch: only wait 5 minutes for all nodes in case of cold restart

https://gerrit.wikimedia.org/r/380524