Page MenuHomePhabricator

Decommission wdqs200[4-6]
Closed, ResolvedPublic

Description

Once T332314 is completed, older WDQS servers can be decommissioned.

Hosts that need to be decom'd (sourced from T326689):
wdqs200[4-6]

wdqs201[3-5] will replace wdqs200[4-6]. wdqs20[16-22] will be net-new hosts.

AC:

  • decommission cookbook has been ran
  • ticket is created for DC-Ops to reclaim the servers => T342600

Details

Related Changes in Gerrit:

Event Timeline

RKemper moved this task from Incoming to In Progress on the Data-Platform-SRE board.
RKemper subscribed.

With the new hosts in service, we can now begin decom'ing these hosts at our convenience.

RKemper renamed this task from Decommission old WDQS servers to Decommission wdqs200[4-6].Jul 24 2023, 11:30 PM
RKemper updated the task description. (Show Details)

Change 941037 had a related patch set uploaded (by Ryan Kemper; author: Ryan Kemper):

[operations/puppet@production] decom wdqs200[4-6]

https://gerrit.wikimedia.org/r/941037

Change 941037 merged by Ryan Kemper:

[operations/puppet@production] decom wdqs200[4-6]

https://gerrit.wikimedia.org/r/941037

cookbooks.sre.hosts.decommission executed by ryankemper@cumin1001 for hosts: wdqs[2004-2006].codfw.wmnet

  • wdqs2004.codfw.wmnet (WARN)
    • Downtimed host on Icinga/Alertmanager
    • Found physical host
    • Management interface not found on Icinga, unable to downtime it
    • Wiped all swraid, partition-table and filesystem signatures
    • Powered off
    • [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
    • Configured the linked switch interface(s)
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
  • wdqs2005.codfw.wmnet (WARN)
    • Downtimed host on Icinga/Alertmanager
    • Found physical host
    • Management interface not found on Icinga, unable to downtime it
    • Wiped all swraid, partition-table and filesystem signatures
    • Powered off
    • [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
    • Configured the linked switch interface(s)
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
  • wdqs2006.codfw.wmnet (WARN)
    • Downtimed host on Icinga/Alertmanager
    • Found physical host
    • Management interface not found on Icinga, unable to downtime it
    • Wiped all swraid, partition-table and filesystem signatures
    • Powered off
    • [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
    • Configured the linked switch interface(s)
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB

Decom cookbook finished, and dc-ops ticket created (see ticket desc AC section for ticket #)