⚓ T189727 decommission elastic1021

Subject	Repo	Branch	Lines +/-
decom elastic1021 prod dns	operations/dns	master	+0 -0
Removing dns for elastic1021	operations/dns	master	+2 -6
decom elastic1021	operations/puppet	production	+0 -11
elastic: decommission elastic1021	operations/puppet	production	+11 -5

		Status	Subtype	Assigned	Task
		Resolved		RobH	T188595 Memory test failure on elastic1021
		Resolved		• Cmjohnson	T189727 decommission elastic1021

RobH triaged this task as Medium priority.Mar 14 2018, 7:13 PM

RobH created this task.

So this system was already offline for memory testing, but I want to confirm with @Gehel we're good to start decommission, which includes wipe of data.

@Gehel: Can you confirm you don't need any information off this host? Additionally, please ensure no other configuration files expect it. I can take over after ' - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.'

Thanks!

RobH moved this task from Backlog to Reclaim (Spares/Decommission) on the hardware-requests board.Mar 14 2018, 7:15 PM

RobH mentioned this in T188595: Memory test failure on elastic1021.

Change 419702 had a related patch set uploaded (by Gehel; owner: Gehel):
[operations/puppet@production] elastic: decommission elastic1021

https://gerrit.wikimedia.org/r/419702

gerritbot added a project: Patch-For-Review.Mar 15 2018, 10:08 AM

Gehel updated the task description. (Show Details)Mar 15 2018, 10:08 AM

Preliminary decommissioning steps are done (pending the merge of https://gerrit.wikimedia.org/r/#/c/419702/). A few notes:

Since elastic1021 is down and does not want to restart, the services normally running on it have not been stopped / masked. Also, modifying the icinga checks would require hacking into puppetdb. The checks are silences and will be purged during the puppet node cleanup phase.

From a service perspective, that host is banned from the cluster and marked as inactive in conftool. Even if it comes back from the deads for whatever reason, it should not cause any trouble.

Gehel moved this task from Incoming to Needs review on the Discovery-Search (Current work) board.Mar 15 2018, 1:05 PM

Thakns @Gehel, I'll steal and proceed from here!

Change 419702 merged by Gehel:
[operations/puppet@production] elastic: decommission elastic1021

https://gerrit.wikimedia.org/r/419702

Since the host is down, I cannot power it on and disable puppet. I have disabled the switch port, so if it does power on it will be fine.

Change 419811 had a related patch set uploaded (by RobH; owner: RobH):
[operations/puppet@production] decom elastic1021

https://gerrit.wikimedia.org/r/419811

Change 419811 merged by RobH:
[operations/puppet@production] decom elastic1021

https://gerrit.wikimedia.org/r/419811

Change 419813 had a related patch set uploaded (by RobH; owner: RobH):
[operations/dns@master] decom elastic1021 prod dns

https://gerrit.wikimedia.org/r/419813

This is now ready for disk wipe and unracking. Please note that since the system won't power on, you'll need to move the disks to a working system to wipe. (Perhaps another system you are already doing disk wipes on.)

Thanks!

• Cmjohnson moved this task from Backlog to Up next on the ops-eqiad board.Mar 28 2018, 5:57 PM

Removing search backend team from this ticket, nothing left to do on our side.

Change 427420 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/dns@master] Removing dns for elastic1021

https://gerrit.wikimedia.org/r/427420

Change 427420 merged by Cmjohnson:
[operations/dns@master] Removing dns for elastic1021

https://gerrit.wikimedia.org/r/427420