Please note these are due back for lease return December 2018. The decommission of these is high priority.
Once elastic2037-2054 (T210450) are configured, we can start removing those old servers. See Server Lifecycle for details.
Steps:
- ban the servers from the cluster
- wait for all shards to relocate
- follow the steps in Server Lifecycle (including adding a checklist for each server in this task).
elastic2001
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2002
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2003
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2004
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2005
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2006
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2007
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2008
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2009
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2010
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2011
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2012
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2013
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2014
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2015
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2016
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2017
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2018
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2019
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2020
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2021
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2022
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2023
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.
elastic2024
Decommission Checklist
- - all system services confirmed offline from production use
- - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
- - remove system from all lvs/pybal active configuration
- - any service group puppet/hiera/dsh config removed
- - remove site.pp (replace with role(spare::system) if system isn't shut down immediately during this process.)
START NON-INTERRUPPTABLE STEPS - please assign to @RobH for the non-interrupt steps
- - disable puppet on host
- - power down host
- - update netbox status to Inventory (if decom) or Planned (if spare)
- - disable switch port
- - switch port assignment noted on this task (for later removal)
- - remove all remaining puppet references (include role::spare) https://gerrit.wikimedia.org/r/478105
- - remove production dns entries https://gerrit.wikimedia.org/r/478106
- - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
- - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)
END NON-INTERRUPPTABLE STEPS
- - system disks wiped (by onsite) - Please note these are SSD systems, and must be wiped using the hdparm utility.
- - system unracked and decommissioned (by onsite), update netbox with result
- - switch port configration removed from switch once system is unracked.
- - add system to decommission tracking google sheet
- - mgmt dns entries removed.
- - update @RobH when all elastic are done so we can move forward with lease return.