Drain, reimage and re-add to the cluster:
- ganeti2019
- ganeti2020
- ganeti2021
- ganeti2022
- ganeti2023
- ganeti2024
- ganeti2025
- ganeti2026
- ganeti2027
- ganeti2028
- ganeti2029
- ganeti2030
- ganeti2031
- ganeti2032
Drain, reimage and re-add to the cluster:
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
Switch ganeti2027 to nftables for upcoming reimage | operations/puppet | production | +1 -0 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T348730 repeated Ganeti VMs deadlocks due to DRBD bug on bullseye | |||
Resolved | MoritzMuehlenhoff | T382508 Update remaining Ganeti servers in codfw to Bookworm |
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2024.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2024.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=05c11855-71d5-489c-8ed8-13baa1a2b7b9) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2019.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2019.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2019.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=00d48b0c-86e6-471d-a6ad-c116ef597e9d) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2021.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2021.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2021.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=93df70a9-c65f-4aaf-8a3d-5ab698636ed0) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2032.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2032.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2032.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=46a6b03e-0964-494b-92f3-40af6ca3beb9) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2022.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2022.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2022.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=4302b551-98b7-475e-9fb4-959f5c56a6cc) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2025.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2025.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2025.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=e9f62dcb-2ecf-4d32-84ca-34c181e86093) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2020.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2020.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2020.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=bc2c7bb0-3133-43fd-9040-c01d53f22d8f) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2026.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2026.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2026.codfw.wmnet with OS bookworm executed with errors:
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2026.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2026.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=160bb060-4ed1-4784-9312-c60a5421c725) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2028.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2028.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2028.codfw.wmnet with OS bookworm executed with errors:
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2028.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2028.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=7af53928-134c-4589-9808-e36a2bde4422) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2031.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2031.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2031.codfw.wmnet with OS bookworm executed with errors:
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2031.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2031.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=83262e5b-e9b2-4d97-bd96-7e9d851edd21) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2030.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2030.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2030.codfw.wmnet with OS bookworm executed with errors:
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2030.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2030.codfw.wmnet with OS bookworm completed:
Icinga downtime and Alertmanager silence (ID=c86b38d9-e3a1-4cba-abc9-083df51a2d3e) set by jmm@cumin2002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: remove from cluster for reimage
ganeti2029.codfw.wmnet
Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host ganeti2029.codfw.wmnet with OS bookworm
Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host ganeti2029.codfw.wmnet with OS bookworm completed:
All Ganeti nodes in codfw have been upgraded to Bookworm (and also migrated to nftables alongside).
Mentioned in SAL (#wikimedia-operations) [2025-01-31T08:52:48Z] <moritzm> rebalance codfw/A following OS updates T382508
Mentioned in SAL (#wikimedia-operations) [2025-01-31T12:16:01Z] <moritzm> rebalance codfw/D following OS updates T382508
Mentioned in SAL (#wikimedia-operations) [2025-02-06T08:24:39Z] <moritzm> rebalance codfw/B following OS updates T382508