This will track the decommission-hardware of graphite2002 and the return of the host to the spares pool (as its less than 4 years old.)
This checklist is able to be copied and pasted into phabricator hardware request tasks for reclaiming systems to spare or decom.
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place. (handled by wmf-decommission-host)
- - remove system from all lvs/pybal active configuration
- - any service group puppet/heira/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS
- - disable puppet on host
- - power down host
- - update status in netbox (planned for spare)
- - disable switch port & update to asset tag name on switch port description
- - remove all remaining puppet references (include role::spare)
- - remove production dns entries
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - remove mgmt dns entries for the hostname, leave the asset tag as this is going to spares
- - system disks wiped (by onsite)
- - remove hostname label of 'graphite2002' as this is returning to spares pool
- - leave host in rack and cabled after disk wipe, as its now a spare pool system.