m1 needs a switchover.
We need to recloned db1176 to become a m1 replica and install 10.4 back (it is running mariadb 11 at the moment)
Description
Description
Details
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
mariadb: Move db1176 to m1 | operations/puppet | production | +8 -2 | |
mariadb: Install MariaDB 11 on db1106 | operations/puppet | production | +2 -3 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Restricted Task | |||||
Restricted Task | |||||
Open | • Marostegui | T326116 Package and test MariaDB 11 | |||
Resolved | • Marostegui | T327762 Move db1176 to m1 |
Event Timeline
Comment Actions
Change 883133 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] mariadb: Install MariaDB 11 on db1106
Comment Actions
Change 883133 merged by Marostegui:
[operations/puppet@production] mariadb: Install MariaDB 11 on db1106
Comment Actions
Change 883136 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] mariadb: Move db1176 to m1
Comment Actions
Change 883136 merged by Marostegui:
[operations/puppet@production] mariadb: Move db1176 to m1
Comment Actions
Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1001 for host db1176.eqiad.wmnet with OS bullseye
Comment Actions
Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1001 for host db1176.eqiad.wmnet with OS bullseye completed:
- db1176 (WARN)
- Downtimed on Icinga/Alertmanager
- Disabled Puppet
- Removed from Puppet and PuppetDB if present
- Deleted any existing Puppet certificate
- Removed from Debmonitor if present
- Forced PXE for next reboot
- Host rebooted via IPMI
- Host up (Debian installer)
- Host up (new fresh bullseye OS)
- Generated Puppet certificate
- Signed new Puppet certificate
- Run Puppet in NOOP mode to populate exported resources in PuppetDB
- Found Nagios_host resource for this host in PuppetDB
- Downtimed the new host on Icinga/Alertmanager
- Removed previous downtime on Alertmanager (old OS)
- First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202301241054_marostegui_57917_db1176.out
- Checked BIOS boot parameters are back to normal
- configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
- Rebooted
- Automatic Puppet run was successful
- Forced a re-check of all Icinga services for the host
- Icinga status is not optimal, downtime not removed
- Updated Netbox data from PuppetDB