Page MenuHomePhabricator

toolforge k8s automation: introduce option to force reboot each node directly
Open, Needs TriagePublic

Description

As of today, the wmcs.toolforge.k8s.reboot cookbook will force-reboot only if the normal drain/reboot/uncordon procedure fails.

In some circumstances, for example when we know all nodes are stuck, we want to just do a rolling force-reboot of the fleet, without waiting for the timeouts.

Event Timeline

aborrero moved this task from Backlog to Automation on the User-aborrero board.
aborrero updated the task description. (Show Details)