Drain, reimage and re-add to cluster:
- ganeti4005
- ganeti4006
- ganeti4007
- ganeti4008
Drain, reimage and re-add to cluster:
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Resolved | MoritzMuehlenhoff | T348730 repeated Ganeti VMs deadlocks due to DRBD bug on bullseye | |||
| Resolved | MoritzMuehlenhoff | T382511 Update Ganeti servers in ulsfo to Bookworm |
Icinga downtime and Alertmanager silence (ID=652b6b7b-5164-4a67-b73d-931451743ac2) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti4005.ulsfo.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti4005.ulsfo.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti4005.ulsfo.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=8b39a21c-2178-4c2d-85ec-b458f3c9ab46) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti4006.ulsfo.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti4006.ulsfo.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti4006.ulsfo.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=754c015a-5966-406b-8711-e527c555dafe) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti4007.ulsfo.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti4007.ulsfo.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti4007.ulsfo.wmnet with OS bookworm completed:
Mentioned in SAL (#wikimedia-operations) [2025-04-01T08:36:05Z] <moritzm> failover ganeti master in ulsfo to ganeti4005 T382511
Icinga downtime and Alertmanager silence (ID=0a440a7c-23d5-411a-82dc-b35d0662b15f) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti4008.ulsfo.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti4008.ulsfo.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti4008.ulsfo.wmnet with OS bookworm completed: