Page MenuHomePhabricator

rename/reimage labnodepool1002.eqiad.wmnet as cloudservices1003.wikimedia.org
Closed, ResolvedPublic

Description

labnodepool1002 is no longer needed for use in that role, but was purchased back in 2017. Recently, we allocated 2 new spare pool machines for cloud team use as cloudservices100[34]. Since this server will also work, and is already allocated to the cloud team for use, we're just going to rename it to cloudservices1003.

This will free up a previously approved allocation of a brand new spare system from T199727.

Since this is decommissioning the hostname of one host and reusing the system, this is a combined (custom) checlist for use:

cloudservices1003:

  • - set all icinga checks for labnodepool1002 to maint mode/disabled
  • - remove ALL references to labnodepool1002 in operations/puppet repo
  • - shutdown the labnodepool1002 OS/server
  • - puppet node clean/deactivate labnodepool1002 on puppetmaster (this clears out monitoring of the old hostname)
  • - remove dbmonitor entries on neodymium/sarin: sudo curl -X DELETE https://debmonitor.discovery.wmnet/hosts/${HOST_FQDN} --cert /etc/debmonitor/ssl/cert.pem --key /etc/debmonitor/ssl/server.key
  • - change mgmt dns entries from labnodepool1002 to cloduservices1003 in mgmt DNS
  • - remove/update/change labnodepool1002 to cloudservices1003 in production DNS
  • - hostname entry in racktables changed from labnodepool1002 to cloudservices1003
  • - physical hostname label changed from labnodepool1002 to cloudservices1003 - ONLY ONSITE CAN DO THIS
  • - physical label field in racktables changed from labnodepool1002 to cloudservices1003 (only after actual physical label step above is done) - ONLY ONSITE SHOULD DO THIS
  • - bios/drac/serial setup/testing, firmware updates. this should be done before OS reimage while system is offline.
  • - mgmt dns entries updated for hostname from labnodepool1002 to cloudservices1003
  • - network port setup (description, enable, vlan) and update description from labnodepool1002 to cloudservices1003 and change to public vlan
  • - production dns entries added (remove old entries for labnodepool1002)
  • - operations/puppet update (install_server at minimum, other files if possible)
  • - OS installation
  • - puppet accept/initial run
  • - handoff for service implementation

Event Timeline

RobH renamed this task from Rename labnodepool1002.eqiad.wmnet as cloudservices1003.eqiad.wmnet to rename/reimage labnodepool1002.eqiad.wmnet as cloudservices1003.eqiad.wmnet.Aug 7 2018, 5:42 PM
RobH updated the task description. (Show Details)
Andrew renamed this task from rename/reimage labnodepool1002.eqiad.wmnet as cloudservices1003.eqiad.wmnet to rename/reimage labnodepool1002.eqiad.wmnet as cloudservices1003.wikimedia.org.Aug 7 2018, 7:17 PM

Change 451084 had a related patch set uploaded (by Andrew Bogott; owner: Andrew Bogott):
[operations/puppet@production] Rename labnodepool1002 to cloudservices1003

https://gerrit.wikimedia.org/r/451084

Mentioned in SAL (#wikimedia-operations) [2018-08-07T19:30:43Z] <andrewbogott> shutting down labnodepool1002 in advance of a rename. T201439

Change 451084 merged by Andrew Bogott:
[operations/puppet@production] Rename labnodepool1002 to cloudservices1003

https://gerrit.wikimedia.org/r/451084

Change 451090 had a related patch set uploaded (by Andrew Bogott; owner: Andrew Bogott):
[operations/dns@master] rename labnodepool1002.mgmt to cloudservices1003.mgmt

https://gerrit.wikimedia.org/r/451090

Change 451091 had a related patch set uploaded (by Andrew Bogott; owner: Andrew Bogott):
[operations/dns@master] Move labnodepool1002.eqiad.wmnet to cloudservices1003.wikimedia.org

https://gerrit.wikimedia.org/r/451091

Change 451090 merged by Andrew Bogott:
[operations/dns@master] rename labnodepool1002.mgmt to cloudservices1003.mgmt

https://gerrit.wikimedia.org/r/451090

Change 451091 merged by Andrew Bogott:
[operations/dns@master] Move labnodepool1002.eqiad.wmnet to cloudservices1003.wikimedia.org

https://gerrit.wikimedia.org/r/451091

Change 451100 had a related patch set uploaded (by Andrew Bogott; owner: Andrew Bogott):
[operations/puppet@production] cloudservices1003: try a different partman recipe

https://gerrit.wikimedia.org/r/451100

Change 451100 merged by Andrew Bogott:
[operations/puppet@production] cloudservices1003: try a different partman recipe

https://gerrit.wikimedia.org/r/451100

Andrew updated the task description. (Show Details)

This server is up and puppetized, with one puppet error which is T201473.

Handing this off to Chris for the DC-specific renames; then this can be closed.

This is all done except for the physical label changes in eqiad.

label changed, racktables updated