⚓ T233080 Decommission analytics1032

Subject	Repo	Branch	Lines +/-
Removing mgmt for the asset tag associated with analytics1032	operations/dns	master	+1 -2
Remove analytics1032's prod DNS records	operations/dns	master	+1 -4
Remove analytics1032 from puppet	operations/puppet	production	+0 -11
Prepare analytics1032 for decommission	operations/puppet	production	+7 -32

elukey created this task.Sep 17 2019, 8:04 AM

Change 537321 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] Prepare analytics1032 for decommission

https://gerrit.wikimedia.org/r/537321

gerritbot added a project: Patch-For-Review.Sep 17 2019, 8:08 AM

Mentioned in SAL (#wikimedia-analytics) [2019-09-17T08:19:40Z] <elukey> manually decommed analytics1032 for hdfs/yarn on the Hadoop testing cluster - T233080

Change 537321 merged by Elukey:
[operations/puppet@production] Prepare analytics1032 for decommission

https://gerrit.wikimedia.org/r/537321

elukey assigned this task to RobH.Sep 17 2019, 9:56 AM

elukey triaged this task as Medium priority.

elukey updated the task description. (Show Details)

Maintenance_bot removed a project: Patch-For-Review.Sep 17 2019, 10:10 AM

cookbooks.sre.hosts.decommission executed by elukey@cumin1001 for hosts: analytics1032.eqiad.wmnet

analytics1032.eqiad.wmnet (FAIL)
- Downtimed host on Icinga
- Downtimed management interface on Icinga
- Unable to connect to the host, wipe of bootloaders will not be performed: Cumin execution failed (exit_code=2)
- Failed to power off, manual intervention required: Remote IPMI for analytics1032.mgmt.eqiad.wmnet failed (exit=1): b''
- Set Netbox status to Decommissioning
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

ERROR: some step on some host failed, check the bolded items above

The host is stuck while booting, so the above script failed. I manually powered it off, but the clean up in puppet/debmonitor/etc.. should have been done anyway.

elukey updated the task description. (Show Details)Oct 1 2019, 6:41 AM

Change 540017 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/puppet@production] Remove analytics1032 from puppet

https://gerrit.wikimedia.org/r/540017

Change 540017 merged by Elukey:
[operations/puppet@production] Remove analytics1032 from puppet

https://gerrit.wikimedia.org/r/540017

elukey@asw2-c-eqiad> show interfaces descriptions | match analytics1032
ge-3/0/12       up    down analytics1032 - no-bw-mon

elukey updated the task description. (Show Details)Oct 1 2019, 6:49 AM

Change 540019 had a related patch set uploaded (by Elukey; owner: Elukey):
[operations/dns@master] Remove analytics1032's prod DNS records

https://gerrit.wikimedia.org/r/540019

Change 540019 merged by Elukey:
[operations/dns@master] Remove analytics1032's prod DNS records

https://gerrit.wikimedia.org/r/540019

elukey updated the task description. (Show Details)Oct 1 2019, 7:07 AM

Maintenance_bot removed a project: Patch-For-Review.Oct 1 2019, 7:10 AM

elukey@asw2-c-eqiad# show | compare
[edit interfaces interface-range disabled]
     member ge-7/0/34 { ... }
+    member ge-3/0/12;
[edit interfaces]
-   ge-3/0/12 {
-       description "analytics1032 - no-bw-mon";
-   }

elukey updated the task description. (Show Details)Oct 2 2019, 7:05 PM

elukey reassigned this task from RobH to • Cmjohnson.Oct 9 2019, 9:23 AM

MoritzMuehlenhoff moved this task from Backlog to pending onsite steps (eqiad) on the decommission-hardware board.Oct 14 2019, 3:15 PM

elukey mentioned this in T239045: analytics1057's BBU is faulty.Nov 26 2019, 7:29 AM

RobH unsubscribed.Mar 3 2020, 6:01 PM

RobH added a project: ops-eqiad.Apr 1 2020, 5:04 PM

RobH updated the task description. (Show Details)

RobH moved this task from Backlog to Decommission on the ops-eqiad board.Apr 1 2020, 5:51 PM

Change 597869 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/dns@master] Removing mgmt for the asset tag associated with analytics1032

https://gerrit.wikimedia.org/r/597869

gerritbot added a project: Patch-For-Review.May 21 2020, 9:41 PM

Change 597869 merged by Cmjohnson:
[operations/dns@master] Removing mgmt for the asset tag associated with analytics1032

https://gerrit.wikimedia.org/r/597869

removed from rack, mgmt dns removed, switch ports were already removed. Updated netbox

Maintenance_bot removed a project: Patch-For-Review.May 21 2020, 10:10 PM

Status	Assigned	Task
Resolved	None	T244211 Analytics Hardware for Fiscal Year 2019/2020
Resolved	Ottomata	T243521 Hadoop Hardware Orders FY2019-2020
Resolved	elukey	T255139 Create the new Hadoop test cluster
		Restricted Task
Resolved	elukey	T211836 Enable Security (stronger authentication and data encryption) for the Analytics Hadoop cluster and its dependent services
Resolved	• Cmjohnson	T227485 Decommission analytics10[28-31,33-41]
Resolved	• Cmjohnson	T233080 Decommission analytics1032

Decommission analytics1032
Closed, ResolvedPublic
Actions

Description

Details

Related Objects
Search...

Event Timeline

Decommission analytics1032Closed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Decommission analytics1032
Closed, ResolvedPublic
Actions

Related Objects
Search...