Page MenuHomePhabricator

Upgrade m5 to Bullseye
Closed, ResolvedPublic

Description

  • db2135
  • db2078
  • db1132
  • db1117
  • Reimage and move db1107 from m3 to m5 so it can later become m5 master.
  • Promote db1107 to m5 master: T302190
  • Reimage and move db1132 somewhere else - to be moved to s1 for 10.6 testing: T301879#7759181

Event Timeline

Marostegui moved this task from Triage to Ready on the DBA board.

Change 762683 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] db1107: Disable notifications

https://gerrit.wikimedia.org/r/762683

Change 762683 merged by Marostegui:

[operations/puppet@production] db1107: Disable notifications

https://gerrit.wikimedia.org/r/762683

Change 762739 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] db2134: Disable notifications

https://gerrit.wikimedia.org/r/762739

Change 762739 merged by Marostegui:

[operations/puppet@production] db2134: Disable notifications

https://gerrit.wikimedia.org/r/762739

Change 762741 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] db2134: Disable notifications

https://gerrit.wikimedia.org/r/762741

Change 762741 merged by Marostegui:

[operations/puppet@production] db2134: Disable notifications

https://gerrit.wikimedia.org/r/762741

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1001 for host db2135.codfw.wmnet with OS bullseye

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1001 for host db2135.codfw.wmnet with OS bullseye completed:

  • db2135 (PASS)
    • Downtimed on Icinga
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202202150814_marostegui_8983_db2135.out
    • Checked BIOS boot parameters are back to normal
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB

db1107 is no longer m3 master - let's wait 24 after the switch to move it to m5

Change 764111 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] db1107: Move from m3 to m5

https://gerrit.wikimedia.org/r/764111

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1001 for host db1107.eqiad.wmnet with OS bullseye

Change 764111 merged by Marostegui:

[operations/puppet@production] db1107: Move from m3 to m5

https://gerrit.wikimedia.org/r/764111

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1001 for host db1107.eqiad.wmnet with OS bullseye completed:

  • db1107 (WARN)
    • Downtimed on Icinga
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202202210620_marostegui_2167_db1107.out
    • Checked BIOS boot parameters are back to normal
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is not optimal, downtime not removed
    • Updated Netbox data from PuppetDB
Marostegui updated the task description. (Show Details)

This was all done - tomorrow I will reimage db1132 to Bullseye and move it to s1 for 10.6 testing: T301879#7759181
Closing this task