Page MenuHomePhabricator

cloudcontrol1006: move to new network setup
Closed, ResolvedPublic

Description

The cloudcontrol1006 server should move to a new network setup.

We should:

  • drop wikimedia.org domain in favor of .eqiad.wmnet.
  • drop connection to asw
  • add private.eqiad.wikimedia.cloud address

Following procedure at https://wikitech.wikimedia.org/wiki/Server_Lifecycle#Move_existing_server_between_rows/racks,_changing_IPs

Event Timeline

This is the active`maintain-dbusers` server at the moment. Moving that requires updating the database grants and firewall rules.

Change 961348 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: move Galera primary to cloudcontrol1007

https://gerrit.wikimedia.org/r/961348

Change 961349 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: move prometheus-openstack-exporter to cloudcontrol1007

https://gerrit.wikimedia.org/r/961349

Change 961348 merged by Majavah:

[operations/puppet@production] hieradata: move Galera primary to cloudcontrol1007

https://gerrit.wikimedia.org/r/961348

Change 961349 merged by Majavah:

[operations/puppet@production] hieradata: move prometheus-openstack-exporter to cloudcontrol1007

https://gerrit.wikimedia.org/r/961349

Change 961442 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] Take cloudcontrol1006 out of service

https://gerrit.wikimedia.org/r/961442

Change 961442 merged by Majavah:

[operations/puppet@production] Take cloudcontrol1006 out of service

https://gerrit.wikimedia.org/r/961442

cookbooks.sre.hosts.decommission executed by taavi@cumin1001 for hosts: cloudcontrol1006.wikimedia.org

  • cloudcontrol1006.wikimedia.org (PASS)
    • Downtimed host on Icinga/Alertmanager
    • Found physical host
    • Downtimed management interface on Alertmanager
    • Wiped all swraid, partition-table and filesystem signatures
    • Powered off
    • [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
    • Configured the linked switch interface(s)
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
taavi updated the task description. (Show Details)
taavi subscribed.

Hi, this host is ready to be moved. Thanks!

@taavi Moved Server. cable id# 5310 port 46

Change 962355 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] site: Put cloudcontrol1006 back into service

https://gerrit.wikimedia.org/r/962355

Cookbook cookbooks.sre.hosts.reimage was started by taavi@cumin1001 for host cloudcontrol1006.eqiad.wmnet with OS bullseye

Change 962355 merged by Majavah:

[operations/puppet@production] site: Put cloudcontrol1006 back into service

https://gerrit.wikimedia.org/r/962355

Cookbook cookbooks.sre.hosts.reimage started by taavi@cumin1001 for host cloudcontrol1006.eqiad.wmnet with OS bullseye completed:

  • cloudcontrol1006 (PASS)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202310020834_taavi_3922148_cloudcontrol1006.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB