Page MenuHomePhabricator

Replace cloudnet100[34] with cloudnet100[56]
Closed, ResolvedPublic

Description

These new hosts are racked but not yet puppetized or in service.

Event Timeline

Change 835657 had a related patch set uploaded (by Andrew Bogott; author: Andrew Bogott):

[operations/puppet@production] Make cloudnet100[56] into cloudnet nodes

https://gerrit.wikimedia.org/r/835657

We agreed with @cmooney and @ayounsi to make them single NIC before putting them into service.

Change 837631 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):

[operations/puppet@production] cloudnet1005/1006: prepare for single NIC setup

https://gerrit.wikimedia.org/r/837631

Change 837631 merged by Arturo Borrero Gonzalez:

[operations/puppet@production] cloudnet1005/1006: prepare for single NIC setup

https://gerrit.wikimedia.org/r/837631

Change 838117 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):

[operations/puppet@production] cloudnet1005/1006: give them proper role

https://gerrit.wikimedia.org/r/838117

Change 838117 merged by Arturo Borrero Gonzalez:

[operations/puppet@production] cloudnet1005/1006: give them proper role

https://gerrit.wikimedia.org/r/838117

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1005 (FAIL)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1005 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210041237_aborrero_2721337_cloudnet1005.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210041237_aborrero_2721275_cloudnet1006.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye completed:

  • cloudnet1005 (WARN)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210051133_aborrero_2971986_cloudnet1005.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is not optimal, downtime not removed
    • Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye completed:

  • cloudnet1006 (FAIL)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210051133_aborrero_2971853_cloudnet1006.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is not optimal, downtime not removed
    • Updated Netbox data from PuppetDB
    • Failed to get Netbox script results, try manually: https://netbox.discovery.wmnet/api/extras/job-results/3824464/

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210051133_aborrero_2971853_cloudnet1006.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is not optimal, downtime not removed
    • Updated Netbox data from PuppetDB
    • Failed to get Netbox script results, try manually: https://netbox.discovery.wmnet/api/extras/job-results/3824464/
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Change 838793 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):

[operations/puppet@production] cloudnet1003: decom host

https://gerrit.wikimedia.org/r/838793

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye completed:

  • cloudnet1006 (WARN)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210051229_aborrero_2984550_cloudnet1006.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is not optimal, downtime not removed
    • Updated Netbox data from PuppetDB

Mentioned in SAL (#wikimedia-cloud) [2022-10-05T14:28:15Z] <arturo> adding cloudinstances2b-gw router to l3 agents on cloudnet1005/1006 (T316284)

For the record:

aborrero@cloudcontrol1005:~ $ sudo wmcs-openstack network agent list | grep 1006
| 2c9a8a8d-2ff4-47a9-80d4-645ac4c5ec50 | Linux bridge agent | cloudnet1006       | None              | :-)   | UP    | neutron-linuxbridge-agent |
| 3f54b3c2-503f-4667-8263-859a259b3b21 | L3 agent           | cloudnet1006       | nova              | :-)   | UP    | neutron-l3-agent          |
| 97b30d69-fd14-4061-a7df-601186626a3c | Metadata agent     | cloudnet1006       | None              | :-)   | UP    | neutron-metadata-agent    |
| e4f71e5d-e182-487d-8c5f-eb15f1ff2bf6 | DHCP agent         | cloudnet1006       | nova              | :-)   | UP    | neutron-dhcp-agent        |
aborrero@cloudcontrol1005:~ 2s $ sudo wmcs-openstack network agent list | grep 1005
| 29547916-33cd-45d8-b33c-4947921ba728 | Linux bridge agent | cloudnet1005       | None              | :-)   | UP    | neutron-linuxbridge-agent |
| 40a4af74-77fb-4341-8589-ae3e8a5fce5a | DHCP agent         | cloudnet1005       | nova              | :-)   | UP    | neutron-dhcp-agent        |
| 6a88c860-29fb-4a85-8aea-6a8877c2e035 | L3 agent           | cloudnet1005       | nova              | :-)   | UP    | neutron-l3-agent          |
| 723d796e-e702-43c9-a0bc-f9645e3ad7d0 | Metadata agent     | cloudnet1005       | None              | :-)   | UP    | neutron-metadata-agent    |
aborrero@cloudcontrol1005:~ 2s $ sudo wmcs-openstack router list
+--------------------------------------+---------------------+--------+-------+---------+-------------+------+
| ID                                   | Name                | Status | State | Project | Distributed | HA   |
+--------------------------------------+---------------------+--------+-------+---------+-------------+------+
| d93771ba-2711-4f88-804a-8df6fd03978a | cloudinstances2b-gw | ACTIVE | UP    | admin   | False       | True |
+--------------------------------------+---------------------+--------+-------+---------+-------------+------+
aborrero@cloudcontrol1005:~ 2s $ sudo wmcs-openstack network agent add router
usage: openstack network agent add router [-h] [--l3] <agent-id> <router>
openstack network agent add router: error: the following arguments are required: <agent-id>, <router>
aborrero@cloudcontrol1005:~ $ sudo wmcs-openstack network agent add router --l3 3f54b3c2-503f-4667-8263-859a259b3b21 d93771ba-2711-4f88-804a-8df6fd03978a
aborrero@cloudcontrol1005:~ 3s $ sudo wmcs-openstack network agent add router --l3 6a88c860-29fb-4a85-8aea-6a8877c2e035 d93771ba-2711-4f88-804a-8df6fd03978a

Icinga downtime and Alertmanager silence (ID=811efa94-5012-4cee-9c58-75a3650086e8) set by aborrero@cumin1001 for 1:00:00 on 1 host(s) and their services with reason: decom

cloudnet1003.eqiad.wmnet

Icinga downtime and Alertmanager silence (ID=a7a84472-3569-4c03-a2d9-0da483f24b3e) set by aborrero@cumin1001 for 1:00:00 on 1 host(s) and their services with reason: decom

cloudnet1004.eqiad.wmnet

Change 835657 abandoned by Andrew Bogott:

[operations/puppet@production] Make cloudnet100[56] into cloudnet nodes

Reason:

this was done elsewhere

https://gerrit.wikimedia.org/r/835657

Icinga downtime and Alertmanager silence (ID=83a14b71-79b5-4be2-ac1b-f073f85678b0) set by aborrero@cumin1001 for 15 days, 0:00:00 on 2 host(s) and their services with reason: migrating

cloudnet[1005-1006].eqiad.wmnet

@cmooney reported a problem with cloudnet bridges and vlan interfaces not being attached to them.

I confirmed this is true and spotted a potential cause:

aborrero@cloudnet1005:~ $ sudo systemctl status networking
● networking.service - Raise network interfaces
     Loaded: loaded (/lib/systemd/system/networking.service; enabled; vendor preset: enabled)
     Active: active (exited) since Thu 2022-10-06 09:36:27 UTC; 6min ago
       Docs: man:interfaces(5)
   Main PID: 1017 (code=exited, status=0/SUCCESS)
      Tasks: 0 (limit: 76785)
     Memory: 0B
        CPU: 0
     CGroup: /system.slice/networking.service

Oct 06 09:36:26 cloudnet1005 systemd[1]: Starting Raise network interfaces...
Oct 06 09:36:26 cloudnet1005 ifup[1046]: interface vlan1107 does not exist!
Oct 06 09:36:27 cloudnet1005 ifup[1098]: interface vlan1105 does not exist!
Oct 06 09:36:27 cloudnet1005 systemd[1]: Finished Raise network interfaces.

So apparently when the system boots, the vlan interfaces are not added to the bridge because they are not UP yet?

Mentioned in SAL (#wikimedia-cloud) [2022-10-06T11:50:31Z] <arturo> set neutron l3 agents on cloudnet1005/1006 as down root@cloudcontrol1005:~# neutron agent-update --admin-state-down <uuid> (T316284)

Mentioned in SAL (#wikimedia-cloud) [2022-10-06T11:54:14Z] <arturo> rebooting cloudnet1005/1006 to see if they have the right network config (T316284)

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye completed:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run failed, asking the operator what to do
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210061316_aborrero_3257462_cloudnet1006.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1006.eqiad.wmnet with OS bullseye executed with errors:

  • cloudnet1006 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run failed, asking the operator what to do
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210061316_aborrero_3257462_cloudnet1006.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by aborrero@cumin1001 for host cloudnet1005.eqiad.wmnet with OS bullseye completed:

  • cloudnet1005 (WARN)
    • Downtimed on Icinga/Alertmanager
    • Unable to disable Puppet, the host may have been unreachable
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202210061445_aborrero_3276126_cloudnet1005.out
    • Checked BIOS boot parameters are back to normal
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB

Mentioned in SAL (#wikimedia-cloud) [2022-10-06T15:55:51Z] <arturo> cloudnet1005 & cloudnet1006 now in service. Secom cloudnet1003 & cloudnet1004. Drop neutron agents, etc. (T316284)

Change 841478 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):

[operations/puppet@production] cloudnet: merge host hiera overrides back into the profile

https://gerrit.wikimedia.org/r/841478

Change 841478 merged by Arturo Borrero Gonzalez:

[operations/puppet@production] cloudnet: merge host hiera overrides back into the profile

https://gerrit.wikimedia.org/r/841478

Change 838793 abandoned by Arturo Borrero Gonzalez:

[operations/puppet@production] cloudnet1003: decom host

Reason:

no longer relevant

https://gerrit.wikimedia.org/r/838793