Page MenuHomePhabricator

Upgrade backup hosts to Debian Bookworm 12.X
Closed, ResolvedPublic

Description

  • backupmon1001
  • backup hosts (progress-24/24 + 4/4 hosts to decom):
    • bacula director and storages (12/12)
    • hosts to decom (4/4)
    • media backup stores (12/12)
  • ms-backup (media backup workers)
  • dbprov hosts
  • db hosts:
    • backup sources
    • media backup databases

Related Objects

StatusSubtypeAssignedTask
Resolvedjcrespo
Resolvedjcrespo
Resolvedjcrespo
ResolvedRequestPapaul
ResolvedJhancock.wm
Resolvedjcrespo
Resolvedjcrespo
Resolved Marostegui
Resolved Marostegui
DeclinedABran-WMF
ResolvedABran-WMF
ResolvedABran-WMF
ResolvedLadsgroup
Resolved Marostegui
Resolved Marostegui
ResolvedRequestJclark-ctr
ResolvedRequestJclark-ctr
Resolved Marostegui
Resolved Marostegui
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedRequestJhancock.wm
ResolvedABran-WMF
Resolvedjcrespo
ResolvedRequestJclark-ctr
ResolvedRequestJhancock.wm
ResolvedRequestJclark-ctr
ResolvedRequestJhancock.wm

Event Timeline

jcrespo triaged this task as High priority.Oct 10 2024, 3:47 PM
jcrespo added a project: Epic.
jcrespo moved this task from Triage to In Progress on the Data-Persistence-Backup board.

Change #1116831 had a related patch set uploaded (by Jcrespo; author: Jcrespo):

[operations/puppet@production] dbbackups: Update grants for x1 dump sections too

https://gerrit.wikimedia.org/r/1116831

Change #1116831 merged by Jcrespo:

[operations/puppet@production] dbbackups: Update grants for x1 dump sections too

https://gerrit.wikimedia.org/r/1116831

Icinga downtime and Alertmanager silence (ID=493da83e-0408-4bfd-a460-3d2a2aaad3bf) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

ms-backup1002.eqiad.wmnet

Icinga downtime and Alertmanager silence (ID=ed206b65-dd8a-4f4d-a989-e6a86b3d3887) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

ms-backup1001.eqiad.wmnet

Icinga downtime and Alertmanager silence (ID=10f40619-3a56-4e29-a8da-746731dbd212) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

ms-backup2002.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=df680574-af14-4955-9937-5d1914444d22) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

ms-backup2001.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=845754b0-673b-41d7-8126-96470a2c0d2e) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

ms-backup2001.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=e7cc4f24-75a7-400a-acc8-ff707cd469d0) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup2009.codfw.wmnet
jcrespo updated the task description. (Show Details)

backup2009 upgraded, 11/24 backup hosts have been upgraded; plus 4 will be decommed soon (T387892).

I just added to the list backupmon1001, a VM that also has to be upgraded.

Icinga downtime and Alertmanager silence (ID=010e024e-fe7e-4445-b38a-8c2cadcb50bd) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup2008.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=d145ffe3-8ff7-43a5-8fce-e6e5cd963182) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup1008.eqiad.wmnet

backup1008 and backup2008 upgraded, too.

Icinga downtime and Alertmanager silence (ID=3109da04-78e8-461e-89f2-1f353f539daf) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup1009.eqiad.wmnet

With the upgrade of backup1009, all non-media backup hosts have been upgraded, with the exception of those about to be decommissioned and backup[12]003, which is in active use. That will be upgraded as soon as the es ro backups finish.

Icinga downtime and Alertmanager silence (ID=580ea459-b022-41a0-b2e3-f26f38703675) set by jynus@cumin1003 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup1003.eqiad.wmnet

backup1003 done, only backup2003 pending, which has to wait the ongoing es-rw backups from backup1013.

Icinga downtime and Alertmanager silence (ID=a969525e-9eb7-4582-8ae8-17b8cdd5058c) set by jynus@cumin1003 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup2003.codfw.wmnet

backup2003 done, only media backup hosts pending to upgrade.

Icinga downtime and Alertmanager silence (ID=08772b21-3872-44e8-8f06-e23fa995033c) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup2004.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=0f261daa-cd8e-4444-a4ad-982760deb79e) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup2005.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=3c23cb22-fc3e-4ca0-8dea-8708e9402a3f) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup2006.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=80a33647-b307-4563-8fc8-c5bea57096ab) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup2007.codfw.wmnet

Icinga downtime and Alertmanager silence (ID=193b6e89-71e3-4f46-b0a6-d7a0247c1152) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup1004.eqiad.wmnet

Icinga downtime and Alertmanager silence (ID=6e1853f2-65a8-4259-a21a-b7d8a7f2ad94) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup1005.eqiad.wmnet
jcrespo updated the task description. (Show Details)

Icinga downtime and Alertmanager silence (ID=fe02c9a5-dd15-4b95-b6de-e1f5eeaa93e4) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup1006.eqiad.wmnet

Icinga downtime and Alertmanager silence (ID=f2d97d5a-efbb-4ab4-b7fe-d7e0bc6f78fa) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backup1007.eqiad.wmnet

Icinga downtime and Alertmanager silence (ID=060a436e-1017-4cc9-a834-e14a0867ca51) set by jynus@cumin1002 for 4:00:00 on 1 host(s) and their services with reason: Maintenance and reboot

backupmon1001.eqiad.wmnet
jcrespo claimed this task.

This is now done, all active backup-related hosts are on Debian 12 "Bookworm":

root@cumin1003:~$ cumin -o txt A:backup 'cat /etc/debian_version; uname -a'
34 hosts will be targeted:
backup[2003-2011,2013].codfw.wmnet,backup[1003-1011,1013-1014].eqiad.wmnet,backupmon1001.eqiad.wmnet,dbprov[2003-2006].codfw.wmnet,dbprov[1003-1006].eqiad.wmnet,ms-backup[2001-2002].codfw.wmnet,ms-backup[1001-1002].eqiad.wmnet
OK to proceed on 34 hosts? Enter the number of affected hosts to confirm or "q" to quit: 34
PASS |██████████████████████████████████████████████████████████████████████████████████████████████████████| 100% (34/34) [00:01<00:00, 28.78hosts/s]
FAIL |                                                                                                               |   0% (0/34) [00:01<?, ?hosts/s]
100.0% (34/34) success ratio (>= 100.0% threshold) for command: 'cat /etc/debian_version; uname -a'.
100.0% (34/34) success ratio (>= 100.0% threshold) of nodes successfully executed all commands.
_____FORMATTED_OUTPUT_____
backup1003.eqiad.wmnet: 12.11
backup1003.eqiad.wmnet: Linux backup1003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1004.eqiad.wmnet: 12.11
backup1004.eqiad.wmnet: Linux backup1004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1005.eqiad.wmnet: 12.11
backup1005.eqiad.wmnet: Linux backup1005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1006.eqiad.wmnet: 12.11
backup1006.eqiad.wmnet: Linux backup1006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1007.eqiad.wmnet: 12.11
backup1007.eqiad.wmnet: Linux backup1007 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1008.eqiad.wmnet: 12.11
backup1008.eqiad.wmnet: Linux backup1008 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1009.eqiad.wmnet: 12.11
backup1009.eqiad.wmnet: Linux backup1009 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1010.eqiad.wmnet: 12.11
backup1010.eqiad.wmnet: Linux backup1010 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1011.eqiad.wmnet: 12.11
backup1011.eqiad.wmnet: Linux backup1011 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1013.eqiad.wmnet: 12.11
backup1013.eqiad.wmnet: Linux backup1013 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup1014.eqiad.wmnet: 12.11
backup1014.eqiad.wmnet: Linux backup1014 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2003.codfw.wmnet: 12.11
backup2003.codfw.wmnet: Linux backup2003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2004.codfw.wmnet: 12.11
backup2004.codfw.wmnet: Linux backup2004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2005.codfw.wmnet: 12.11
backup2005.codfw.wmnet: Linux backup2005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2006.codfw.wmnet: 12.11
backup2006.codfw.wmnet: Linux backup2006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2007.codfw.wmnet: 12.11
backup2007.codfw.wmnet: Linux backup2007 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2008.codfw.wmnet: 12.11
backup2008.codfw.wmnet: Linux backup2008 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2009.codfw.wmnet: 12.11
backup2009.codfw.wmnet: Linux backup2009 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2010.codfw.wmnet: 12.11
backup2010.codfw.wmnet: Linux backup2010 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2011.codfw.wmnet: 12.11
backup2011.codfw.wmnet: Linux backup2011 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backup2013.codfw.wmnet: 12.11
backup2013.codfw.wmnet: Linux backup2013 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
backupmon1001.eqiad.wmnet: 12.11
backupmon1001.eqiad.wmnet: Linux backupmon1001 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov1003.eqiad.wmnet: 12.11
dbprov1003.eqiad.wmnet: Linux dbprov1003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov1004.eqiad.wmnet: 12.11
dbprov1004.eqiad.wmnet: Linux dbprov1004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov1005.eqiad.wmnet: 12.11
dbprov1005.eqiad.wmnet: Linux dbprov1005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov1006.eqiad.wmnet: 12.11
dbprov1006.eqiad.wmnet: Linux dbprov1006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov2003.codfw.wmnet: 12.11
dbprov2003.codfw.wmnet: Linux dbprov2003 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov2004.codfw.wmnet: 12.11
dbprov2004.codfw.wmnet: Linux dbprov2004 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov2005.codfw.wmnet: 12.11
dbprov2005.codfw.wmnet: Linux dbprov2005 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
dbprov2006.codfw.wmnet: 12.11
dbprov2006.codfw.wmnet: Linux dbprov2006 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
ms-backup1001.eqiad.wmnet: 12.11
ms-backup1001.eqiad.wmnet: Linux ms-backup1001 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
ms-backup1002.eqiad.wmnet: 12.11
ms-backup1002.eqiad.wmnet: Linux ms-backup1002 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
ms-backup2001.codfw.wmnet: 12.11
ms-backup2001.codfw.wmnet: Linux ms-backup2001 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
ms-backup2002.codfw.wmnet: 12.11
ms-backup2002.codfw.wmnet: Linux ms-backup2002 6.1.0-37-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.1.140-1 (2025-05-22) x86_64 GNU/Linux
root@cumin1003:~$ cumin -o txt A:db-backup-source 'cat /etc/debian_version'
14 hosts will be targeted:
db[2141,2197-2201,2239].codfw.wmnet,db[1150,1171,1216,1225,1239-1240,1245].eqiad.wmnet
OK to proceed on 14 hosts? Enter the number of affected hosts to confirm or "q" to quit: 14
PASS |██████████████████████████████████████████████████████████████████████████████████████████████████████| 100% (14/14) [00:01<00:00, 13.83hosts/s]
FAIL |                                                                                                               |   0% (0/14) [00:01<?, ?hosts/s]
100.0% (14/14) success ratio (>= 100.0% threshold) for command: 'cat /etc/debian_version'.
100.0% (14/14) success ratio (>= 100.0% threshold) of nodes successfully executed all commands.
_____FORMATTED_OUTPUT_____
db1150.eqiad.wmnet: 12.10
db1171.eqiad.wmnet: 12.10
db1216.eqiad.wmnet: 12.11
db1225.eqiad.wmnet: 12.11
db1239.eqiad.wmnet: 12.10
db1240.eqiad.wmnet: 12.10
db1245.eqiad.wmnet: 12.10
db2141.codfw.wmnet: 12.10
db2197.codfw.wmnet: 12.11
db2198.codfw.wmnet: 12.10
db2199.codfw.wmnet: 12.10
db2200.codfw.wmnet: 12.11
db2201.codfw.wmnet: 12.10
db2239.codfw.wmnet: 12.10