Page MenuHomePhabricator

Decommission esams ms-fe / ms-be
Closed, ResolvedPublic

Description

The esams machines used for swift are old and unused for production purposes, we should decom


  • - all system services confirmed offline from production use
  • - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
  • - remove system from all lvs/pybal active configuration
  • - any service group puppet/hiera/dsh config removed
  • - remove site.pp (replace with role::spare if system isn't shut down immediately during this process.)

START NON-INTERRUPPTABLE STEPS

  • - disable puppet on host
  • - remove all remaining puppet references (include role::spare)
  • - power down host
  • - disable switch port
  • - switch port assignment noted on this task (for later removal)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate

END NON-INTERRUPPTABLE STEPS

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: mgmt dns entries removed.

Event Timeline

Change 362965 had a related patch set uploaded (by Filippo Giunchedi; owner: Filippo Giunchedi):
[operations/puppet@production] Decom swift cluster in esams

https://gerrit.wikimedia.org/r/362965

Change 362965 merged by Filippo Giunchedi:
[operations/puppet@production] Decom swift cluster in esams

https://gerrit.wikimedia.org/r/362965

Change 362968 had a related patch set uploaded (by Filippo Giunchedi; owner: Filippo Giunchedi):
[operations/dns@master] Decom ms-fe.svc.esams.wmnet

https://gerrit.wikimedia.org/r/362968

Change 362968 merged by Filippo Giunchedi:
[operations/dns@master] Decom ms-fe.svc.esams.wmnet

https://gerrit.wikimedia.org/r/362968

I've reimaged all ms-be / ms-fe in esams and wiped data disks on the former, left to do is to wipe only the OS disks when the time comes for decom

Dzahn updated the task description. (Show Details)
Dzahn updated the task description. (Show Details)

Change 403215 had a related patch set uploaded (by Dzahn; owner: Dzahn):
[operations/puppet@production] decom esams swift machines, rm from puppet/dhcp

https://gerrit.wikimedia.org/r/403215

Change 403216 had a related patch set uploaded (by Dzahn; owner: Dzahn):
[operations/dns@master] decom esams swift machines, keep mgmt

https://gerrit.wikimedia.org/r/403216

Change 403215 merged by Dzahn:
[operations/puppet@production] decom esams swift machines, rm from puppet/dhcp

https://gerrit.wikimedia.org/r/403215

Mentioned in SAL (#wikimedia-operations) [2018-01-09T18:42:53Z] <mutante> ms-fe3002,ms-fe3001 - powering down, removing from puppet and icinga, ms-be* removing from puppet/icinga (T169518)

Change 403216 merged by Dzahn:
[operations/dns@master] decom esams swift machines, keep mgmt

https://gerrit.wikimedia.org/r/403216

Dzahn added subscribers: mark, Dzahn.

@mark @fgiunchedi They are shutdown and removed from Icinga and DNS now. Only the "disable switch port" part i could not do due to lack of access. I copied the check boxes from the decom template on the wikitech server lifecycle page.

Thanks a lot @Dzahn for taking care of this!

Change 547337 had a related patch set uploaded (by Papaul; owner: Papaul):
[operations/dns@master] DNS: Remove mgmt Dns for ms-be300[1-4] and ms-fe300[1-2]

https://gerrit.wikimedia.org/r/547337

Change 547337 merged by Papaul:
[operations/dns@master] DNS: Remove mgmt Dns for ms-be300[1-4] and ms-fe300[1-2]

https://gerrit.wikimedia.org/r/547337

Papaul updated the task description. (Show Details)

Complete