Page MenuHomePhabricator

Migrate the URL downloaders to Bullseye
Closed, ResolvedPublic

Description

These currently use Buster and should be migrated to Bullseye.

Event Timeline

MoritzMuehlenhoff triaged this task as Medium priority.

Change 890003 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Add new VMs

https://gerrit.wikimedia.org/r/890003

Change 890003 merged by Muehlenhoff:

[operations/puppet@production] Add new VMs

https://gerrit.wikimedia.org/r/890003

Cookbook cookbooks.sre.ganeti.reimage was started by jmm@cumin2002 for host urldownloader2003.wikimedia.org with OS bullseye

Cookbook cookbooks.sre.ganeti.reimage started by jmm@cumin2002 for host urldownloader2003.wikimedia.org with OS bullseye completed:

  • urldownloader2003 (PASS)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via gnt-instance
    • Host up (Debian installer)
    • Set boot to disk
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/ganeti/reimage/202302241013_jmm_848980_urldownloader2003.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed

Cookbook cookbooks.sre.ganeti.reimage was started by jmm@cumin2002 for host urldownloader2004.wikimedia.org with OS bullseye

Cookbook cookbooks.sre.ganeti.reimage started by jmm@cumin2002 for host urldownloader2004.wikimedia.org with OS bullseye completed:

  • urldownloader2004 (PASS)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via gnt-instance
    • Host up (Debian installer)
    • Set boot to disk
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/ganeti/reimage/202302241113_jmm_861678_urldownloader2004.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed

Mentioned in SAL (#wikimedia-operations) [2023-02-24T15:11:56Z] <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "new url downloaders - jmm@cumin2002 - T329945"

Mentioned in SAL (#wikimedia-operations) [2023-02-24T15:21:20Z] <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "new url downloaders - jmm@cumin2002 - T329945"

Cookbook cookbooks.sre.ganeti.reimage was started by jmm@cumin2002 for host urldownloader1003.wikimedia.org with OS bullseye

Cookbook cookbooks.sre.ganeti.reimage started by jmm@cumin2002 for host urldownloader1003.wikimedia.org with OS bullseye executed with errors:

  • urldownloader1003 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via gnt-instance
    • Host up (Debian installer)
    • Set boot to disk
    • The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.ganeti.reimage was started by jmm@cumin2002 for host urldownloader1003.wikimedia.org with OS bullseye

Cookbook cookbooks.sre.ganeti.reimage started by jmm@cumin2002 for host urldownloader1003.wikimedia.org with OS bullseye executed with errors:

  • urldownloader1003 (FAIL)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via gnt-instance
    • Host up (Debian installer)
    • Set boot to disk
    • The reimage failed, see the cookbook logs for the details

Change 896099 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Add urldownloader100[34] to site.pp

https://gerrit.wikimedia.org/r/896099

Change 896099 merged by Muehlenhoff:

[operations/puppet@production] Add urldownloader100[34] to site.pp

https://gerrit.wikimedia.org/r/896099

Cookbook cookbooks.sre.ganeti.reimage was started by jmm@cumin2002 for host urldownloader1004.wikimedia.org with OS bullseye

Cookbook cookbooks.sre.ganeti.reimage started by jmm@cumin2002 for host urldownloader1004.wikimedia.org with OS bullseye completed:

  • urldownloader1004 (PASS)
    • Removed from Puppet and PuppetDB if present
    • Deleted any existing Puppet certificate
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via gnt-instance
    • Host up (Debian installer)
    • Set boot to disk
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/ganeti/reimage/202303101107_jmm_503880_urldownloader1004.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed

Change 896325 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Apply url_downloader role to urldownloader2004

https://gerrit.wikimedia.org/r/896325

Change 896325 merged by Muehlenhoff:

[operations/puppet@production] Apply url_downloader role to urldownloader2004

https://gerrit.wikimedia.org/r/896325

Change 916512 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/deployment-charts@master] Add new url downloaders to ACLs

https://gerrit.wikimedia.org/r/916512

Change 916512 merged by jenkins-bot:

[operations/deployment-charts@master] Add new url downloaders to ACLs

https://gerrit.wikimedia.org/r/916512

Change 926421 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/dns@master] Point codfw URL downloader to new bullseye host

https://gerrit.wikimedia.org/r/926421

Change 926496 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Also apply url downloader role to remaining Bullseye hosts

https://gerrit.wikimedia.org/r/926496

Change 926496 merged by Muehlenhoff:

[operations/puppet@production] Also apply url downloader role to remaining Bullseye hosts

https://gerrit.wikimedia.org/r/926496

Change 926421 merged by Muehlenhoff:

[operations/dns@master] Point codfw URL downloader to new bullseye host

https://gerrit.wikimedia.org/r/926421

Change 932830 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/dns@master] Point codfw URL downloader to new bullseye host

https://gerrit.wikimedia.org/r/932830

Change 932830 merged by Muehlenhoff:

[operations/dns@master] Point codfw URL downloader to new bullseye host

https://gerrit.wikimedia.org/r/932830

Change 933441 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/dns@master] Point eqiad URL downloaders to bullseye host

https://gerrit.wikimedia.org/r/933441

Change 933441 merged by Muehlenhoff:

[operations/dns@master] Point eqiad URL downloaders to bullseye host

https://gerrit.wikimedia.org/r/933441

Change 933904 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Remove urldownloader role from old buster servers

https://gerrit.wikimedia.org/r/933904

Change 933904 merged by Muehlenhoff:

[operations/puppet@production] Remove urldownloader role from old buster servers

https://gerrit.wikimedia.org/r/933904

Icinga downtime and Alertmanager silence (ID=04703ca5-7468-4229-a4bf-5a47b58763e0) set by jmm@cumin2002 for 7 days, 0:00:00 on 2 host(s) and their services with reason: pending decom

urldownloader[2001-2002].wikimedia.org

Change 935713 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/deployment-charts@master] Remove old URL downloaders

https://gerrit.wikimedia.org/r/935713

cookbooks.sre.hosts.decommission executed by jmm@cumin2002 for hosts: urldownloader2001.wikimedia.org

  • urldownloader2001.wikimedia.org (PASS)
    • Downtimed host on Icinga/Alertmanager
    • Found Ganeti VM
    • VM shutdown
    • Started forced sync of VMs in Ganeti cluster codfw to Netbox
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
    • VM removed
    • Started forced sync of VMs in Ganeti cluster codfw to Netbox

cookbooks.sre.hosts.decommission executed by jmm@cumin2002 for hosts: urldownloader2002.wikimedia.org

  • urldownloader2002.wikimedia.org (PASS)
    • Downtimed host on Icinga/Alertmanager
    • Found Ganeti VM
    • VM shutdown
    • Started forced sync of VMs in Ganeti cluster codfw to Netbox
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
    • VM removed
    • Started forced sync of VMs in Ganeti cluster codfw to Netbox

cookbooks.sre.hosts.decommission executed by jmm@cumin2002 for hosts: urldownloader1002.wikimedia.org

  • urldownloader1002.wikimedia.org (PASS)
    • Downtimed host on Icinga/Alertmanager
    • Found Ganeti VM
    • VM shutdown
    • Started forced sync of VMs in Ganeti cluster eqiad to Netbox
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
    • VM removed
    • Started forced sync of VMs in Ganeti cluster eqiad to Netbox

cookbooks.sre.hosts.decommission executed by jmm@cumin2002 for hosts: urldownloader1001.wikimedia.org

  • urldownloader1001.wikimedia.org (PASS)
    • Downtimed host on Icinga/Alertmanager
    • Found Ganeti VM
    • VM shutdown
    • Started forced sync of VMs in Ganeti cluster eqiad to Netbox
    • Removed from DebMonitor
    • Removed from Puppet master and PuppetDB
    • VM removed
    • Started forced sync of VMs in Ganeti cluster eqiad to Netbox

Change 935713 merged by jenkins-bot:

[operations/deployment-charts@master] Remove old URL downloaders

https://gerrit.wikimedia.org/r/935713

We're running new URL downloaders on Bullseye (urldownloader[12]00[34].wikimedia.org) an the one Buster systems have been decommissioned.