Page MenuHomePhabricator

Decommission db11[26-49]
Closed, ResolvedPublic

Description

Following T344036

  • db1126
  • db1127
  • db1128
  • db1129
  • db1130
  • db1131
  • db1132
  • db1133
  • db1134
  • db1135
  • db1136
  • db1137
  • db1138
  • db1139
  • db1140
  • db1141
  • db1142
  • db1143
  • db1144
  • db1145
  • db1146
  • db1147
  • db1148
  • db1149

Related Objects

StatusSubtypeAssignedTask
ResolvedArnoldokoth
ResolvedJhancock.wm
ResolvedJclark-ctr
ResolvedABran-WMF
ResolvedABran-WMF
ResolvedRequestJclark-ctr
ResolvedRequestJclark-ctr
ResolvedRequestJclark-ctr
ResolvedRequestJclark-ctr
Resolved Marostegui
ResolvedRequestVRiley-WMF

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Change 981440 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: decommission db1138

https://gerrit.wikimedia.org/r/981440

Icinga downtime and Alertmanager silence (ID=bffacd05-8841-49a1-9e9d-e042eb304501) set by arnaudb@cumin1001 for 1 day, 0:00:00 on 1 host(s) and their services with reason: decomission pre downtime

db1138.eqiad.wmnet

Change 981440 merged by Arnaudb:

[operations/puppet@production] mariadb: decommission db1138

https://gerrit.wikimedia.org/r/981440

Change 981441 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: remove db1141 db1142 db1143

https://gerrit.wikimedia.org/r/981441

Change 981441 merged by Arnaudb:

[operations/puppet@production] mariadb: remove db1141 db1142 db1143

https://gerrit.wikimedia.org/r/981441

Change 982198 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: decommission db1128 db1129 db1147

https://gerrit.wikimedia.org/r/982198

Change 982198 merged by Arnaudb:

[operations/puppet@production] mariadb: decommission db1128 db1129 db1147

https://gerrit.wikimedia.org/r/982198

Change 982875 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: decommission hosts

https://gerrit.wikimedia.org/r/982875

Change 982875 merged by Arnaudb:

[operations/puppet@production] mariadb: decommission hosts

https://gerrit.wikimedia.org/r/982875

ABran-WMF updated the task description. (Show Details)
ABran-WMF changed the task status from Open to In Progress.Dec 20 2023, 3:43 PM

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:29:06Z] <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1144.eqiad.wmnet with reason: provisionning db1234.eqiad.wmnet - T350458

Icinga downtime and Alertmanager silence (ID=cc740753-fd98-4755-ad37-dfd92af53af6) set by arnaudb@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: provisionning db1234.eqiad.wmnet - T350458

db1144.eqiad.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:29:40Z] <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1144.eqiad.wmnet with reason: provisionning db1234.eqiad.wmnet - T350458

Icinga downtime and Alertmanager silence (ID=7b097df6-7429-4e29-9f7c-164dd2eb37b4) set by arnaudb@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: provisionning db1234.eqiad.wmnet - T350458

db1244.eqiad.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:29:53Z] <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1244.eqiad.wmnet with reason: provisionning db1234.eqiad.wmnet - T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:30:25Z] <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1244.eqiad.wmnet with reason: provisionning db1234.eqiad.wmnet - T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:38:44Z] <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1144.eqiad.wmnet with reason: provisionning db1244.eqiad.wmnet - T350458

Icinga downtime and Alertmanager silence (ID=da094b6a-f059-477c-957d-fee829374a3f) set by arnaudb@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: provisionning db1244.eqiad.wmnet - T350458

db1144.eqiad.wmnet

Icinga downtime and Alertmanager silence (ID=44ed7cb6-a06a-48a1-8efe-519851e204a7) set by arnaudb@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: provisionning db1244.eqiad.wmnet - T350458

db1244.eqiad.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:39:00Z] <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1144.eqiad.wmnet with reason: provisionning db1244.eqiad.wmnet - T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:39:14Z] <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1244.eqiad.wmnet with reason: provisionning db1244.eqiad.wmnet - T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:39:28Z] <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1244.eqiad.wmnet with reason: provisionning db1244.eqiad.wmnet - T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-01T13:41:08Z] <arnaudb@cumin1002> dbctl commit (dc=all): 'Cloning db1144 in db1244 for T350458', diff saved to https://phabricator.wikimedia.org/P56064 and previous config saved to /var/cache/conftool/dbconfig/20240201-134107-arnaudb.json

Mentioned in SAL (#wikimedia-operations) [2024-02-01T14:12:49Z] <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: provisionning db1246.eqiad.wmnet - T350458

Icinga downtime and Alertmanager silence (ID=fe9402f0-5956-4560-bb9a-16fb066dd913) set by arnaudb@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: provisionning db1246.eqiad.wmnet - T350458

db1146.eqiad.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-02-01T14:13:06Z] <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: provisionning db1246.eqiad.wmnet - T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-01T14:13:21Z] <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1246.eqiad.wmnet with reason: provisionning db1246.eqiad.wmnet - T350458

Icinga downtime and Alertmanager silence (ID=539c6327-d7ed-40e3-9363-fa3f56dcbac9) set by arnaudb@cumin1002 for 1 day, 0:00:00 on 1 host(s) and their services with reason: provisionning db1246.eqiad.wmnet - T350458

db1246.eqiad.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-02-01T14:13:33Z] <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1246.eqiad.wmnet with reason: provisionning db1246.eqiad.wmnet - T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-01T14:15:32Z] <arnaudb@cumin1002> dbctl commit (dc=all): 'Cloning db1146 in db1246 for T350458', diff saved to https://phabricator.wikimedia.org/P56067 and previous config saved to /var/cache/conftool/dbconfig/20240201-141531-arnaudb.json

Technically no blocker for decom of db1139, db1140 or db1145, although I would wait 1 day to make sure backups run correctly on the new hosts at least once.

@jcrespo everything is going OK with all the new servers? if so, I'll close this task!

All backups looking good. Please proceed.

Change 1000300 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: decom db1133

https://gerrit.wikimedia.org/r/1000300

Change 1000301 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1135

https://gerrit.wikimedia.org/r/1000301

Change 1000302 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1139

https://gerrit.wikimedia.org/r/1000302

Change 1000303 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1140

https://gerrit.wikimedia.org/r/1000303

Change 1000304 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1144

https://gerrit.wikimedia.org/r/1000304

Change 1000305 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1145

https://gerrit.wikimedia.org/r/1000305

Change 1002406 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1146

https://gerrit.wikimedia.org/r/1002406

Change 1002407 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1149

https://gerrit.wikimedia.org/r/1002407

Change 1002406 merged by Arnaudb:

[operations/puppet@production] mariadb: removes db1146

https://gerrit.wikimedia.org/r/1002406

Change 1000303 merged by Arnaudb:

[operations/puppet@production] mariadb: removes db1140

https://gerrit.wikimedia.org/r/1000303

Change 1000305 merged by Arnaudb:

[operations/puppet@production] mariadb: removes db1145

https://gerrit.wikimedia.org/r/1000305

Change 1000304 abandoned by Arnaudb:

[operations/puppet@production] mariadb: removes db1144

Reason:

https://gerrit.wikimedia.org/r/1000304

Change 1000301 merged by Arnaudb:

[operations/puppet@production] mariadb: removes db1135

https://gerrit.wikimedia.org/r/1000301

Change 1002407 abandoned by Arnaudb:

[operations/puppet@production] mariadb: removes db1149

Reason:

https://gerrit.wikimedia.org/r/1002407

Change 1002411 had a related patch set uploaded (by Arnaudb; author: Arnaudb):

[operations/puppet@production] mariadb: removes db1144 db1149

https://gerrit.wikimedia.org/r/1002411

Change 1002411 merged by Arnaudb:

[operations/puppet@production] mariadb: removes db1144 db1149

https://gerrit.wikimedia.org/r/1002411

Change 1000302 merged by Arnaudb:

[operations/puppet@production] mariadb: removes db1139

https://gerrit.wikimedia.org/r/1000302

Change 1000300 merged by Arnaudb:

[operations/puppet@production] mariadb: decom db1133

https://gerrit.wikimedia.org/r/1000300

Mentioned in SAL (#wikimedia-operations) [2024-02-12T16:34:08Z] <arnaudb@cumin1002> dbctl commit (dc=all): 'Removing instances as per T350458', diff saved to https://phabricator.wikimedia.org/P56683 and previous config saved to /var/cache/conftool/dbconfig/20240212-163407-arnaudb.json

Mentioned in SAL (#wikimedia-operations) [2024-02-13T11:45:17Z] <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on db1133.eqiad.wmnet with reason: T350458

Mentioned in SAL (#wikimedia-operations) [2024-02-13T11:45:31Z] <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1133.eqiad.wmnet with reason: T350458

ABran-WMF updated the task description. (Show Details)