Page MenuHomePhabricator

Decommission backup1001, backup1002, backup2001, backup2002 (and their arrays)
Open, In Progress, MediumPublic

Description

Rather than upgrading these hosts, we will decommission by setting up different hosts and upgrade those (or set them up already upgraded for the first time):

  • backup1001 (to be replaced by backup1009)
  • backup1002 (to be replaced by backup1013)
  • backup2001 (to be replaced by backup2009)
  • backup2002 (to be replaced by backup2013)

Event Timeline

jcrespo changed the task status from Open to In Progress.Mar 4 2025, 4:47 PM
jcrespo triaged this task as Medium priority.
jcrespo updated the task description. (Show Details)

Change #1124487 had a related patch set uploaded (by Jcrespo; author: Jcrespo):

[operations/puppet@production] dbbackups: Prepare backup1013 to take over eqiad backups of es* dbs

https://gerrit.wikimedia.org/r/1124487

Change #1124487 merged by Jcrespo:

[operations/puppet@production] dbbackups: Prepare backup1013 to take over eqiad backups of es* dbs

https://gerrit.wikimedia.org/r/1124487

Change #1124720 had a related patch set uploaded (by Jcrespo; author: Jcrespo):

[operations/puppet@production] dbbackups: Prepare backup2013 to take over codfw backups of es* dbs

https://gerrit.wikimedia.org/r/1124720

Change #1124720 merged by Jcrespo:

[operations/puppet@production] dbbackups: Prepare backup2013 to take over codfw backups of es* dbs

https://gerrit.wikimedia.org/r/1124720

Mentioned in SAL (#wikimedia-operations) [2025-03-05T09:18:53Z] <jynus> deploy new backup grants for es1036,es1040 T387892

Mentioned in SAL (#wikimedia-operations) [2025-03-05T09:23:07Z] <jynus> deploy new backup grants for es2036,es2040 T387892

Change #1124738 had a related patch set uploaded (by Jcrespo; author: Jcrespo):

[operations/puppet@production] dbbackups: Migrate es backups from backup[12]02 to backup[12]13

https://gerrit.wikimedia.org/r/1124738

Change #1124738 merged by Jcrespo:

[operations/puppet@production] dbbackups: Migrate es backups from backup[12]02 to backup[12]13

https://gerrit.wikimedia.org/r/1124738

Mentioned in SAL (#wikimedia-operations) [2025-03-05T15:40:35Z] <jynus> starting es backups on new hosts backup1013, backup2013 T387892

I made a mistake, and it was that I didn't add the new hosts into the production grants of m1 (for backup state tracking). No fatal issue (metadata gathering is optional and won't be a hard failure), but it will mean we will have to fix that tomorrow and test it again.

Change #1124834 had a related patch set uploaded (by Jcrespo; author: Jcrespo):

[operations/puppet@production] dbbackups: Add additional m1 grants for backup[12]013 stats user

https://gerrit.wikimedia.org/r/1124834

Mentioned in SAL (#wikimedia-operations) [2025-03-06T09:28:35Z] <jynus> deploy additional grants to m1 T387892

Change #1124834 merged by Jcrespo:

[operations/puppet@production] dbbackups: Add additional m1 grants for backup[12]013 stats user

https://gerrit.wikimedia.org/r/1124834

Change #1125114 had a related patch set uploaded (by Jcrespo; author: Jcrespo):

[operations/puppet@production] dbbackups: Prepare backup1002, backup2002 for decommissioning

https://gerrit.wikimedia.org/r/1125114

Mentioned in SAL (#wikimedia-operations) [2025-03-12T10:14:28Z] <jynus> removing backup1002, backup2002 dump user on es6,es7 T387892

Mentioned in SAL (#wikimedia-operations) [2025-03-12T10:42:21Z] <jynus> removing backup1002, backup2002 dbbackups user @ m1 T387892

Change #1125114 merged by Jcrespo:

[operations/puppet@production] dbbackups: Prepare backup1002, backup2002 for decommissioning

https://gerrit.wikimedia.org/r/1125114