Page MenuHomePhabricator

Decommission osm-db200[12] and osm-web200[1234]
Closed, ResolvedPublic

Description

Should be empty, also see T182033 and T182034.

This task will track the decommission-hardware of servers: osm-db200[12] & osm-cp200[1-4]. Please note NONE of these systems were installed, and do NOT have an OS on them. We cannot confirm they were properly wiped from their previous OS though, so they do need to have their disks wiped.

There is no puppet to stop, they are not in puppetdb or monitoring, so this is a modified checklist.

The first 5 steps should be completed by the service owner that is returning the server to DC-ops (for reclaim to spare or decommissioning, dependent on server configuration and age.)

osm-db2001:

  • - systems never deployed, no icinga, no monitoring, nothing to remove in puppet repo, no production dns entires.

Steps for DC-Ops:

  • - update netbox status to Inventory (if decom)
  • - disable switch port - no switch port assigned on switch labels to disable
  • - switch port assignment noted on this task (for later removal) - no switch port assigned on switch labels to disable
  • - remove all remaining puppet references (include role::spare)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
  • - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)

End non-interrupt steps.

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update Netbox with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: add system to decommission tracking google sheet
  • - IF DECOM: mgmt dns entries removed.
  • - change netbox status to offline when unracked

osm-db2002:

  • - systems never deployed, no icinga, no monitoring, nothing to remove in puppet repo, no production dns entires.

Steps for DC-Ops:

  • - update netbox status to Inventory (if decom)
  • - disable switch port - no switch port assigned on switch labels to disable
  • - switch port assignment noted on this task (for later removal) - no switch port assigned on switch labels to disable
  • - remove all remaining puppet references (include role::spare)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
  • - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)

End non-interrupt steps.

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update Netbox with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: add system to decommission tracking google sheet
  • - IF DECOM: mgmt dns entries removed.
  • - change netbox status to offline when unracked

osm-web2001:

  • - systems never deployed, no icinga, no monitoring, nothing to remove in puppet repo, no production dns entires.

Steps for DC-Ops:

  • - update netbox status to Inventory (if decom)
  • - disable switch port - no switch port assigned on switch labels to disable
  • - switch port assignment noted on this task (for later removal) - no switch port assigned on switch labels to disable
  • - remove all remaining puppet references (include role::spare)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
  • - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)

End non-interrupt steps.

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: add system to decommission tracking google sheet
  • - IF DECOM: mgmt dns entries removed.
  • - change netbox status to offline when unracked

osm-web2002:

  • - systems never deployed, no icinga, no monitoring, nothing to remove in puppet repo, no production dns entires.

Steps for DC-Ops:

  • - update netbox status to Inventory (if decom)
  • - disable switch port - no switch port assigned on switch labels to disable
  • - switch port assignment noted on this task (for later removal) - no switch port assigned on switch labels to disable
  • - remove all remaining puppet references (include role::spare)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
  • - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)

End non-interrupt steps.

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: add system to decommission tracking google sheet
  • - IF DECOM: mgmt dns entries removed.
  • - change netbox status to offline when unracked

osm-web2003:

  • - systems never deployed, no icinga, no monitoring, nothing to remove in puppet repo, no production dns entires.

Steps for DC-Ops:

  • - update netbox status to Inventory (if decom)
  • - disable switch port - no switch port assigned on switch labels to disable
  • - switch port assignment noted on this task (for later removal) - no switch port assigned on switch labels to disable
  • - remove all remaining puppet references (include role::spare)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
  • - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)

End non-interrupt steps.

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: add system to decommission tracking google sheet
  • - IF DECOM: mgmt dns entries removed.
  • - change netbox status to offline when unracked

osm-web2004:

  • - systems never deployed, no icinga, no monitoring, nothing to remove in puppet repo, no production dns entires.

Steps for DC-Ops:

  • - update netbox status to Inventory (if decom)
  • - disable switch port - no switch port assigned on switch labels to disable
  • - switch port assignment noted on this task (for later removal) - no switch port assigned on switch labels to disable
  • - remove all remaining puppet references (include role::spare)
  • - remove production dns entries
  • - puppet node clean, puppet node deactivate (handled by wmf-decommission-host)
  • - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key (handled by wmf-decommission-host)

End non-interrupt steps.

  • - system disks wiped (by onsite)
  • - IF DECOM: system unracked and decommissioned (by onsite), update racktables with result
  • - IF DECOM: switch port configration removed from switch once system is unracked.
  • - IF DECOM: add system to decommission tracking google sheet
  • - IF DECOM: mgmt dns entries removed.
  • - change netbox status to offline when unracked

Event Timeline

faidon created this task.
RobH raised the priority of this task from Low to Medium.
RobH updated the task description. (Show Details)
RobH added a project: decommission-hardware.
RobH moved this task from Backlog to pending onsite steps (codfw) on the decommission-hardware board.

osm-db2001

papaul@asw-a-codfw> show interfaces ge-5/0/2 descriptions    
Interface       Admin Link Description
ge-5/0/2        down  down DISABLED

osm-db2002

papaul@asw-b-codfw> show interfaces ge-5/0/2 descriptions 
Interface       Admin Link Description
ge-5/0/2        down  down DISABLED

osm-web2001

papaul@asw-a-codfw# run show interfaces ge-5/0/10 descriptions 
Interface       Admin Link Description
ge-5/0/10       down  down DISABLED

osm-web2002

papaul@asw-b-codfw> show interfaces ge-5/0/10 descriptions   
Interface       Admin Link Description
ge-5/0/10       down  down DISABLED

osm-web2003

papaul@asw-a-codfw> show interfaces ge-5/0/7 descriptions     
Interface       Admin Link Description
ge-5/0/7        down  down DISABLED

osm-web2004

papaul@asw-b-codfw> show interfaces ge-5/0/7 descriptions     
Interface       Admin Link Description
ge-5/0/7        down  down DISABLED

Change 508738 had a related patch set uploaded (by Papaul; owner: Papaul):
[operations/dns@master] DNS: Remove mgmt DNS for osm-db200[12] and osm-web200[1234]

https://gerrit.wikimedia.org/r/508738

Change 508738 abandoned by Papaul:
DNS: Remove mgmt DNS for osm-db200[12] and osm-web200[1234]

Reason:
mistake

https://gerrit.wikimedia.org/r/508738

Change 508823 had a related patch set uploaded (by Papaul; owner: Papaul):
[operations/dns@master] DNS: Remove mgmt DNS for osm-db200[12] and osm-web200[1234]

https://gerrit.wikimedia.org/r/508823

Change 508823 merged by Dzahn:
[operations/dns@master] DNS: Remove mgmt DNS for osm-db200[12] and osm-web200[1234]

https://gerrit.wikimedia.org/r/508823