⚓ T313733 Decommission mc20[19-27] and mc20[29-37]

	Subject	Repo	Branch	Lines +/-
	site: Remove retired mc* hosts	operations/puppet	production	+0 -5

akosiaris created this task.Jul 25 2022, 2:31 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJul 25 2022, 2:31 PM

jijiki moved this task from Incoming 🐫 to 🙈🙉🙊Backlog on the serviceops board.Sep 28 2022, 2:17 PM

jijiki moved this task from 🙈🙉🙊Backlog to 🛠 Upgrades and Hardware on the serviceops board.Sep 28 2022, 4:31 PM

akosiaris added a subtask: T293012: Productionise mc20[38-55].Dec 15 2022, 12:19 PM

jijiki renamed this task from Decommission mc2019-mc2036 to Decommission mc2019-mc2037.Dec 16 2022, 11:29 AM

jijiki claimed this task.

jijiki updated the task description. (Show Details)

jijiki moved this task from 🛠 Upgrades and Hardware to Doing 😎 on the serviceops board.Dec 16 2022, 11:30 AM

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2019.codfw.wmnet

mc2019.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2020.codfw.wmnet

mc2020.codfw.wmnet (FAIL)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Failed to wipe swraid, partition-table and filesystem signatures, manual intervention required to make it unbootable: Cumin execution failed (exit_code=2)
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

ERROR: some step on some host failed, check the bolded items above

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2021.codfw.wmnet

mc2021.codfw.wmnet (FAIL)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Failed to wipe swraid, partition-table and filesystem signatures, manual intervention required to make it unbootable: Cumin execution failed (exit_code=2)
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

ERROR: some step on some host failed, check the bolded items above

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2022.codfw.wmnet

mc2022.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2023.codfw.wmnet

mc2023.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2024.codfw.wmnet

mc2024.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2025.codfw.wmnet

mc2025.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2026.codfw.wmnet

mc2026.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2027.codfw.wmnet

mc2027.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2029.codfw.wmnet

mc2029.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2030.codfw.wmnet

mc2030.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2031.codfw.wmnet

mc2031.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2032.codfw.wmnet

mc2032.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2033.codfw.wmnet

mc2033.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2034.codfw.wmnet

mc2034.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2035.codfw.wmnet

mc2035.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2036.codfw.wmnet

mc2036.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

cookbooks.sre.hosts.decommission executed by jiji@cumin1001 for hosts: mc2037.codfw.wmnet

mc2037.codfw.wmnet (WARN)
- Downtimed host on Icinga/Alertmanager
- Found physical host
- Management interface not found on Icinga, unable to downtime it
- Wiped all swraid, partition-table and filesystem signatures
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

Change 878177 had a related patch set uploaded (by Effie Mouzeli; author: Effie Mouzeli):

[operations/puppet@production] site: Remove retired mc* hosts

https://gerrit.wikimedia.org/r/878177

gerritbot added a project: Patch-For-Review.Jan 10 2023, 7:10 PM

jijiki renamed this task from Decommission mc2019-mc2037 to Decommission mc20[19-27] and mc20[29-37].Jan 10 2023, 7:52 PM

jijiki updated Other Assignee, added: Jclark-ctr.

jijiki edited projects, added DC-Ops, ops-eqiad; removed serviceops.

jijiki updated the task description. (Show Details)

@Jclark-ctr please note that mc2020 and mc2021 are probably still bootable due to a failure during running the decomm script

jijiki reassigned this task from jijiki to Jclark-ctr.Jan 10 2023, 7:58 PM

jijiki updated Other Assignee, removed: Jclark-ctr.

jijiki subscribed.

Maintenance_bot added a project: SRE.Jan 10 2023, 8:45 PM

Change 878177 merged by Effie Mouzeli:

[operations/puppet@production] site: Remove retired mc* hosts

https://gerrit.wikimedia.org/r/878177

Maintenance_bot removed a project: Patch-For-Review.Jan 11 2023, 1:30 PM

jijiki removed Jclark-ctr as the assignee of this task.Jan 12 2023, 6:38 PM

jijiki edited projects, added ops-codfw; removed ops-eqiad.

Papaul assigned this task to Jhancock.wm.Jan 12 2023, 6:40 PM

Hello can someone please confirm that those servers are ready for decom since they are are all active in Netbox . Thanks

@Papaul, this is my bad, thank you for taking care of Netbox (or the gentle soul that did so).

@Papaul I've finished the onsite items. SSDs have been removed, servers have been unracked. Servers have been moved to the storage cage and will work on removing side rails and staging the SSDs.

@Jhancock.wm thank you

Papaul moved this task from Backlog to Decommission on the ops-codfw board.Jan 18 2023, 12:59 AM

complete

Status	Assigned	Task
Resolved	Jhancock.wm	T313733 Decommission mc20[19-27] and mc20[29-37]
Resolved	jijiki	T293012 Productionise mc20[38-55]
		Unknown Object (Task)
Resolved	Papaul	T294962 Q2:(Need By: TBD) rack/setup/install mc20[38-55]
Duplicate	None	T302218 setup/install mc20[38-55]

Decommission mc20[19-27] and mc20[29-37]
Closed, ResolvedPublic
Actions

Description

2019.codfw.net

2020.codfw.net

2021.codfw.net

2022.codfw.net

2023.codfw.net

2024.codfw.net

2025.codfw.net

2026.codfw.net

2027.codfw.net

2029.codfw.net

2030.codfw.net

2031.codfw.net

2032.codfw.net

2033.codfw.net

2034.codfw.net

2035.codfw.net

2036.codfw.net

2037.codfw.net

Details

Related Objects
Search...

Event Timeline

Decommission mc20[19-27] and mc20[29-37]Closed, ResolvedPublicActions