Page MenuHomePhabricator

decommission mobile 1004 and mobile1005
Closed, ResolvedPublic

Description

mobile1004 and mobile1005 are still in the rack but are not being used and production dns was removed sometime ago. These servers are out of warranty since 2014 and need to be decommissioned

  • - all system services confirmed offline from production use
  • - set all icinga checks to maint mode/disabled while reclaim/decommission takes place.
  • - remove system from all lvs/pybal active configuration
  • - any service group puppet/heira/dsh config removed
  • - remove site.pp (replace with role::spare if system isn't shut down immediately during this process.)

START NON-INTERRUPPTABLE STEPS

  • - disable puppet on host
  • - remove all remaining puppet references (include role::spare)
  • - power down host
  • - disable switch port
  • - switch port assignment noted on this task (for later removal)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate, salt key removed

END NON-INTERRUPPTABLE STEPS

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: mgmt dns entries removed.
  • - IF RECLAIM: system added back to spares tracking (by onsite)

Event Timeline

Cmjohnson moved this task from Backlog to Decommission on the ops-eqiad board.

Change 429855 had a related patch set uploaded (by Cmjohnson; owner: Cmjohnson):
[operations/dns@master] Removing dns for mobile1004/1005

https://gerrit.wikimedia.org/r/429855

Change 429855 merged by Cmjohnson:
[operations/dns@master] Removing dns for mobile1004/1005

https://gerrit.wikimedia.org/r/429855

still needs port descriptions updated.

Cmjohnson updated the task description. (Show Details)