- backupmon1001
- backup hosts (progress-24/24 + 4/4 hosts to decom):
- bacula director and storages (12/12)
- hosts to decom (4/4)
- media backup stores (12/12)
- ms-backup (media backup workers)
- dbprov hosts
- db hosts:
- backup sources
- media backup databases
Description
Details
| Subject | Repo | Branch | Lines +/- | |
|---|---|---|---|---|
| dbbackups: Update grants for x1 dump sections too | operations/puppet | production | +32 -34 |
Event Timeline
Change #1116831 had a related patch set uploaded (by Jcrespo; author: Jcrespo):
[operations/puppet@production] dbbackups: Update grants for x1 dump sections too
Change #1116831 merged by Jcrespo:
[operations/puppet@production] dbbackups: Update grants for x1 dump sections too
Icinga downtime and Alertmanager silence (ID=493da83e-0408-4bfd-a460-3d2a2aaad3bf) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
ms-backup1002.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=ed206b65-dd8a-4f4d-a989-e6a86b3d3887) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
ms-backup1001.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=10f40619-3a56-4e29-a8da-746731dbd212) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
ms-backup2002.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=df680574-af14-4955-9937-5d1914444d22) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
ms-backup2001.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=845754b0-673b-41d7-8126-96470a2c0d2e) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
ms-backup2001.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=e7cc4f24-75a7-400a-acc8-ff707cd469d0) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup2009.codfw.wmnet
backup2009 upgraded, 11/24 backup hosts have been upgraded; plus 4 will be decommed soon (T387892).
I just added to the list backupmon1001, a VM that also has to be upgraded.
Icinga downtime and Alertmanager silence (ID=010e024e-fe7e-4445-b38a-8c2cadcb50bd) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup2008.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=d145ffe3-8ff7-43a5-8fce-e6e5cd963182) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup1008.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=3109da04-78e8-461e-89f2-1f353f539daf) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup1009.eqiad.wmnet
With the upgrade of backup1009, all non-media backup hosts have been upgraded, with the exception of those about to be decommissioned and backup[12]003, which is in active use. That will be upgraded as soon as the es ro backups finish.
Icinga downtime and Alertmanager silence (ID=580ea459-b022-41a0-b2e3-f26f38703675) set by jynus@cumin1003 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup1003.eqiad.wmnet
backup1003 done, only backup2003 pending, which has to wait the ongoing es-rw backups from backup1013.
Icinga downtime and Alertmanager silence (ID=a969525e-9eb7-4582-8ae8-17b8cdd5058c) set by jynus@cumin1003 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup2003.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=08772b21-3872-44e8-8f06-e23fa995033c) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup2004.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=0f261daa-cd8e-4444-a4ad-982760deb79e) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup2005.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=3c23cb22-fc3e-4ca0-8dea-8708e9402a3f) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup2006.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=80a33647-b307-4563-8fc8-c5bea57096ab) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup2007.codfw.wmnet
Icinga downtime and Alertmanager silence (ID=193b6e89-71e3-4f46-b0a6-d7a0247c1152) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup1004.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=6e1853f2-65a8-4259-a21a-b7d8a7f2ad94) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup1005.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=fe02c9a5-dd15-4b95-b6de-e1f5eeaa93e4) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup1006.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=f2d97d5a-efbb-4ab4-b7fe-d7e0bc6f78fa) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backup1007.eqiad.wmnet
Icinga downtime and Alertmanager silence (ID=060a436e-1017-4cc9-a834-e14a0867ca51) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot
backupmon1001.eqiad.wmnet
This is now done, all active backup-related hosts are on Debian 12 "Bookworm":
root@cumin1003:~$ cumin -o txt A:backup 'cat /etc/debian_version; uname -a' 34 hosts will be targeted: backup[2003-2011,2013].codfw.wmnet,backup[1003-1011,1013-1014].eqiad.wmnet,backupmon1001.eqiad.wmnet,dbprov[2003-2006].codfw.wmnet,dbprov[1003-1006].eqiad.wmnet,ms-backup[2001-2002].codfw.wmnet,ms-backup[1001-1002].eqiad.wmnet OK to proceed on 34 hosts? Enter the number of affected hosts to confirm or "q" to quit: 34 PASS |██████████████████████████████████████████████████████████████████████████████████████████████████████| 100% (34/34) [00:01<00:00, 28.78hosts/s] FAIL | | 0% (0/34) [00:01<?, ?hosts/s] 100.0% (34/34) success ratio (>= 100.0% threshold) for command: 'cat /etc/debian_version; uname -a'. 100.0% (34/34) success ratio (>= 100.0% threshold) of nodes successfully executed all commands. _____FORMATTED_OUTPUT_____ backup1003.eqiad.wmnet: 12.11 backup1003.eqiad.wmnet: Linux backup1003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1004.eqiad.wmnet: 12.11 backup1004.eqiad.wmnet: Linux backup1004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1005.eqiad.wmnet: 12.11 backup1005.eqiad.wmnet: Linux backup1005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1006.eqiad.wmnet: 12.11 backup1006.eqiad.wmnet: Linux backup1006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1007.eqiad.wmnet: 12.11 backup1007.eqiad.wmnet: Linux backup1007 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1008.eqiad.wmnet: 12.11 backup1008.eqiad.wmnet: Linux backup1008 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1009.eqiad.wmnet: 12.11 backup1009.eqiad.wmnet: Linux backup1009 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1010.eqiad.wmnet: 12.11 backup1010.eqiad.wmnet: Linux backup1010 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1011.eqiad.wmnet: 12.11 backup1011.eqiad.wmnet: Linux backup1011 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1013.eqiad.wmnet: 12.11 backup1013.eqiad.wmnet: Linux backup1013 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup1014.eqiad.wmnet: 12.11 backup1014.eqiad.wmnet: Linux backup1014 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2003.codfw.wmnet: 12.11 backup2003.codfw.wmnet: Linux backup2003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2004.codfw.wmnet: 12.11 backup2004.codfw.wmnet: Linux backup2004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2005.codfw.wmnet: 12.11 backup2005.codfw.wmnet: Linux backup2005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2006.codfw.wmnet: 12.11 backup2006.codfw.wmnet: Linux backup2006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2007.codfw.wmnet: 12.11 backup2007.codfw.wmnet: Linux backup2007 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2008.codfw.wmnet: 12.11 backup2008.codfw.wmnet: Linux backup2008 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2009.codfw.wmnet: 12.11 backup2009.codfw.wmnet: Linux backup2009 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2010.codfw.wmnet: 12.11 backup2010.codfw.wmnet: Linux backup2010 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2011.codfw.wmnet: 12.11 backup2011.codfw.wmnet: Linux backup2011 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backup2013.codfw.wmnet: 12.11 backup2013.codfw.wmnet: Linux backup2013 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux backupmon1001.eqiad.wmnet: 12.11 backupmon1001.eqiad.wmnet: Linux backupmon1001 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov1003.eqiad.wmnet: 12.11 dbprov1003.eqiad.wmnet: Linux dbprov1003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov1004.eqiad.wmnet: 12.11 dbprov1004.eqiad.wmnet: Linux dbprov1004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov1005.eqiad.wmnet: 12.11 dbprov1005.eqiad.wmnet: Linux dbprov1005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov1006.eqiad.wmnet: 12.11 dbprov1006.eqiad.wmnet: Linux dbprov1006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov2003.codfw.wmnet: 12.11 dbprov2003.codfw.wmnet: Linux dbprov2003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov2004.codfw.wmnet: 12.11 dbprov2004.codfw.wmnet: Linux dbprov2004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov2005.codfw.wmnet: 12.11 dbprov2005.codfw.wmnet: Linux dbprov2005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux dbprov2006.codfw.wmnet: 12.11 dbprov2006.codfw.wmnet: Linux dbprov2006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux ms-backup1001.eqiad.wmnet: 12.11 ms-backup1001.eqiad.wmnet: Linux ms-backup1001 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux ms-backup1002.eqiad.wmnet: 12.11 ms-backup1002.eqiad.wmnet: Linux ms-backup1002 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux ms-backup2001.codfw.wmnet: 12.11 ms-backup2001.codfw.wmnet: Linux ms-backup2001 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux ms-backup2002.codfw.wmnet: 12.11 ms-backup2002.codfw.wmnet: Linux ms-backup2002 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
root@cumin1003:~$ cumin -o txt A:db-backup-source 'cat /etc/debian_version' 14 hosts will be targeted: db[2141,2197-2201,2239].codfw.wmnet,db[1150,1171,1216,1225,1239-1240,1245].eqiad.wmnet OK to proceed on 14 hosts? Enter the number of affected hosts to confirm or "q" to quit: 14 PASS |██████████████████████████████████████████████████████████████████████████████████████████████████████| 100% (14/14) [00:01<00:00, 13.83hosts/s] FAIL | | 0% (0/14) [00:01<?, ?hosts/s] 100.0% (14/14) success ratio (>= 100.0% threshold) for command: 'cat /etc/debian_version'. 100.0% (14/14) success ratio (>= 100.0% threshold) of nodes successfully executed all commands. _____FORMATTED_OUTPUT_____ db1150.eqiad.wmnet: 12.10 db1171.eqiad.wmnet: 12.10 db1216.eqiad.wmnet: 12.11 db1225.eqiad.wmnet: 12.11 db1239.eqiad.wmnet: 12.10 db1240.eqiad.wmnet: 12.10 db1245.eqiad.wmnet: 12.10 db2141.codfw.wmnet: 12.10 db2197.codfw.wmnet: 12.11 db2198.codfw.wmnet: 12.10 db2199.codfw.wmnet: 12.10 db2200.codfw.wmnet: 12.11 db2201.codfw.wmnet: 12.10 db2239.codfw.wmnet: 12.10