Page MenuHomePhabricator

Refresh restbase202[1-3] w/ restbase203[6-8]
Open, MediumPublic

Assigned To
None
Authored By
Eevans
Mon, Nov 18, 10:50 PM
Referenced Files
F57718961: image.png
Mon, Nov 18, 11:05 PM
F57718958: image.png
Mon, Nov 18, 11:05 PM
F57718949: image.png
Mon, Nov 18, 11:01 PM
Subscribers

Description

RESTBase cluster hosts restbase202[1-3] are EOL, replace them with new hosts restbase203[6-8].

  • Bootstrap
    • restbase2036 (row B)
    • restbase2037 (row C)
    • restbase2038 (row D)
  • Decommission
    • restbase2021 (row B)
      • a
      • b
      • c
    • restbase2022 (row C)
    • restbase2023 (row D)
  • Pool RESTBase

See also: T377896: Q2:rack/setup/install restbase203[6-8]

Event Timeline

restbase2036 additional IPs:

image.png (221×1 px, 51 KB)

restbase2037 & restbase2038 additional IPs:

image.png (221×1 px, 51 KB)

image.png (221×1 px, 51 KB)

Change #1092345 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/puppet@production] restbase: commission restbase203[6-8]

https://gerrit.wikimedia.org/r/1092345

Change #1092345 merged by Eevans:

[operations/puppet@production] restbase: commission restbase203[6-8]

https://gerrit.wikimedia.org/r/1092345

Mentioned in SAL (#wikimedia-operations) [2024-11-19T21:14:59Z] <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2036.codfw.wmnet with reason: Bootstrapping — T380236

Icinga downtime and Alertmanager silence (ID=4b99fd18-11df-4c74-95bb-c99c4757af17) set by eevans@cumin1002 for 30 days, 0:00:00 on 1 host(s) and their services with reason: Bootstrapping — T380236

restbase2036.codfw.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-11-19T21:15:14Z] <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2036.codfw.wmnet with reason: Bootstrapping — T380236

Mentioned in SAL (#wikimedia-operations) [2024-11-19T21:15:21Z] <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2037.codfw.wmnet with reason: Bootstrapping — T380236

Icinga downtime and Alertmanager silence (ID=f37decf6-9f83-4f16-82fa-48fe21f85399) set by eevans@cumin1002 for 30 days, 0:00:00 on 1 host(s) and their services with reason: Bootstrapping — T380236

restbase2037.codfw.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-11-19T21:15:35Z] <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2037.codfw.wmnet with reason: Bootstrapping — T380236

Mentioned in SAL (#wikimedia-operations) [2024-11-19T21:15:46Z] <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2038.codfw.wmnet with reason: Bootstrapping — T380236

Icinga downtime and Alertmanager silence (ID=9292e44e-bf6f-4071-93e5-55cc17a4bb52) set by eevans@cumin1002 for 30 days, 0:00:00 on 1 host(s) and their services with reason: Bootstrapping — T380236

restbase2038.codfw.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-11-19T21:16:00Z] <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2038.codfw.wmnet with reason: Bootstrapping — T380236

Mentioned in SAL (#wikimedia-operations) [2024-11-20T15:09:24Z] <urandom> bootstrapping cassandra, restbase2037-{a,b,c} — T380236

Eevans triaged this task as Medium priority.Wed, Nov 20, 3:11 PM
Eevans updated the task description. (Show Details)

Mentioned in SAL (#wikimedia-operations) [2024-11-20T19:12:15Z] <urandom> bootstrapping cassandra, restbase2038-{a,b,c} — T380236

Mentioned in SAL (#wikimedia-operations) [2024-11-21T00:42:12Z] <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2021.codfw.wmnet with reason: Decommissioning — T380236

Icinga downtime and Alertmanager silence (ID=aa62de12-5de6-4c95-aca3-db5cc67a1e73) set by eevans@cumin1002 for 30 days, 0:00:00 on 1 host(s) and their services with reason: Decommissioning — T380236

restbase2021.codfw.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-11-21T00:42:16Z] <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2021.codfw.wmnet with reason: Decommissioning — T380236

Mentioned in SAL (#wikimedia-operations) [2024-11-21T00:42:19Z] <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2022.codfw.wmnet with reason: Decommissioning — T380236

Icinga downtime and Alertmanager silence (ID=941362f7-c8d8-42d0-8eec-c2f1f00b7709) set by eevans@cumin1002 for 30 days, 0:00:00 on 1 host(s) and their services with reason: Decommissioning — T380236

restbase2022.codfw.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-11-21T00:42:31Z] <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2022.codfw.wmnet with reason: Decommissioning — T380236

Mentioned in SAL (#wikimedia-operations) [2024-11-21T00:42:36Z] <eevans@cumin1002> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2023.codfw.wmnet with reason: Decommissioning — T380236

Icinga downtime and Alertmanager silence (ID=38479039-f507-4251-8172-d1957f1540a8) set by eevans@cumin1002 for 30 days, 0:00:00 on 1 host(s) and their services with reason: Decommissioning — T380236

restbase2023.codfw.wmnet

Mentioned in SAL (#wikimedia-operations) [2024-11-21T00:42:46Z] <eevans@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2023.codfw.wmnet with reason: Decommissioning — T380236

Mentioned in SAL (#wikimedia-operations) [2024-11-21T00:45:56Z] <urandom> decommissioning Cassandra/restbase2021-{a,b,c} — T380236