Page MenuHomePhabricator

ops-monitoring-bot (Operations Monitoring Bot)
UserBot

Projects (3)

Today

  • No visible events.

Tomorrow

  • No visible events.

Wednesday

  • No visible events.

User Details

User Since
Aug 12 2016, 1:45 PM (512 w, 3 d)
Roles
Bot
Availability
Available
LDAP User
Unknown
MediaWiki User
Unknown

Bot managed by SRE for automated interaction with Phabricator from monitoring tools.

Recent Activity

Today

ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es2042.codfw.wmnet with OS trixie completed:

  • es2042 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606081449_marostegui_1980850_es2042.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 3:07 PM · DBA
ops-monitoring-bot added a comment to T427357: codfw: rack A4 maintenance.

Icinga downtime and Alertmanager silence (ID=6c545cca-39bb-4652-9cd8-5da4fc40f265) set by jynus@cumin2002 for 4:00:00 on 2 host(s) and their services with reason: Switchover db

db[2183-2184].codfw.wmnet
Mon, Jun 8, 3:03 PM · Infrastructure-Foundations, netops, Observability-Logging, Machine-Learning-Team, Traffic, ServiceOps new, Discovery-Search
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es2042.codfw.wmnet with OS trixie

Mon, Jun 8, 2:26 PM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Completed depooling of es2042 by marostegui@cumin1003: Upgrading es2042.codfw.wmnet

Mon, Jun 8, 2:25 PM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Upgrading es2042.codfw.wmnet

Mon, Jun 8, 2:25 PM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es1041.eqiad.wmnet with OS trixie completed:

  • es1041 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606081318_marostegui_1968582_es1041.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 1:35 PM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es1041.eqiad.wmnet with OS trixie

Mon, Jun 8, 12:57 PM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Completed depooling of es1041 by marostegui@cumin1003: Upgrading es1041.eqiad.wmnet

Mon, Jun 8, 12:56 PM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Upgrading es1041.eqiad.wmnet

Mon, Jun 8, 12:55 PM · DBA
ops-monitoring-bot added a comment to T354872: Re-IP Swift hosts to per-rack subnets in codfw rows A-D.

Cookbook cookbooks.sre.hosts.reimage started by mvernon@cumin2002 for host ms-be2063.codfw.wmnet with OS bullseye completed:

  • ms-be2063 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Host successfully migrated to the new VLAN
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606081219_mvernon_1095115_ms-be2063.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 12:40 PM · SRE-swift-storage, Infrastructure-Foundations, SRE
ops-monitoring-bot added a comment to T354872: Re-IP Swift hosts to per-rack subnets in codfw rows A-D.

Cookbook cookbooks.sre.hosts.reimage started by mvernon@cumin2002 for host ms-be2062.codfw.wmnet with OS bullseye completed:

  • ms-be2062 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Host successfully migrated to the new VLAN
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606081213_mvernon_1093975_ms-be2062.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 12:32 PM · SRE-swift-storage, Infrastructure-Foundations, SRE
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es2041.codfw.wmnet with OS trixie completed:

  • es2041 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606081209_marostegui_1957788_es2041.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 12:27 PM · DBA
ops-monitoring-bot added a comment to T354872: Re-IP Swift hosts to per-rack subnets in codfw rows A-D.

Cookbook cookbooks.sre.hosts.reimage was started by mvernon@cumin2002 for host ms-be2063.codfw.wmnet with OS bullseye

Mon, Jun 8, 11:50 AM · SRE-swift-storage, Infrastructure-Foundations, SRE
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es2041.codfw.wmnet with OS trixie

Mon, Jun 8, 11:47 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Completed depooling of es2041 by marostegui@cumin1003: Upgrading es2041.codfw.wmnet

Mon, Jun 8, 11:45 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Upgrading es2041.codfw.wmnet

Mon, Jun 8, 11:45 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Upgrading es2042.codfw.wmnet

Mon, Jun 8, 11:44 AM · DBA
ops-monitoring-bot added a comment to T354872: Re-IP Swift hosts to per-rack subnets in codfw rows A-D.

Cookbook cookbooks.sre.hosts.reimage was started by mvernon@cumin2002 for host ms-be2062.codfw.wmnet with OS bullseye

Mon, Jun 8, 11:42 AM · SRE-swift-storage, Infrastructure-Foundations, SRE
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es1042.eqiad.wmnet with OS trixie completed:

  • es1042 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606081039_marostegui_1894481_es1042.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 10:56 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es1042.eqiad.wmnet with OS trixie

Mon, Jun 8, 10:19 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Completed depooling of es1042 by marostegui@cumin1003: Upgrading es1042.eqiad.wmnet

Mon, Jun 8, 10:15 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Upgrading es1042.eqiad.wmnet

Mon, Jun 8, 10:13 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es2043.codfw.wmnet with OS trixie completed:

  • es2043 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606080929_marostegui_1846847_es2043.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 9:46 AM · DBA
ops-monitoring-bot added a comment to T428405: ProbeDown - gitlab1004:443 has failed probes (http_gitlab_wikimedia_org_ip4).

Host gitlab1004.wikimedia.org rebooted by jelto@cumin1003 with reason: gitlab restart is stuck

Mon, Jun 8, 9:17 AM · collaboration-services
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es2043.codfw.wmnet with OS trixie

Mon, Jun 8, 9:07 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Completed depooling of es2043 by marostegui@cumin1003: Upgrading es2043.codfw.wmnet

Mon, Jun 8, 9:06 AM · DBA
ops-monitoring-bot added a comment to T428386: Migrate es4 section to Debian Trixie.

Upgrading es2043.codfw.wmnet

Mon, Jun 8, 9:06 AM · DBA
ops-monitoring-bot added a comment to T423069: Migrate m2 to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host db1217.eqiad.wmnet with OS trixie completed:

  • db1217 (WARN)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606080819_marostegui_1813634_db1217.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is not optimal, downtime not removed
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 8:41 AM · DBA
ops-monitoring-bot added a comment to T415977: [wikireplicas] Create views for new wiki urwikisource.

Section s5: Wikis urwikisource set up on clouddb - fceratto@cumin1003

Mon, Jun 8, 8:03 AM · Data-Services, cloud-services-team
ops-monitoring-bot added a comment to T423069: Migrate m2 to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host db1217.eqiad.wmnet with OS trixie

Mon, Jun 8, 7:55 AM · DBA
ops-monitoring-bot added a comment to T415977: [wikireplicas] Create views for new wiki urwikisource.

Section s5: Wikis urwikisource redacted - fceratto@cumin1003

Mon, Jun 8, 7:53 AM · Data-Services, cloud-services-team
ops-monitoring-bot added a comment to T415977: [wikireplicas] Create views for new wiki urwikisource.

Section s5: Wikis urwikisource redacted - fceratto@cumin1003

Mon, Jun 8, 7:50 AM · Data-Services, cloud-services-team
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es1051.eqiad.wmnet with OS trixie completed:

  • es1051 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606080620_marostegui_1789103_es1051.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 6:36 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es1051.eqiad.wmnet with OS trixie

Mon, Jun 8, 5:58 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es2052.codfw.wmnet with OS trixie completed:

  • es2052 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606080539_marostegui_1786432_es2052.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Mon, Jun 8, 5:54 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Upgrading es1051.eqiad.wmnet

Mon, Jun 8, 5:33 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es2052.codfw.wmnet with OS trixie

Mon, Jun 8, 5:19 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Completed depooling of es2052 by marostegui@cumin1003: Upgrading es2052.codfw.wmnet

Mon, Jun 8, 5:18 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Upgrading es2052.codfw.wmnet

Mon, Jun 8, 5:18 AM · DBA

Fri, Jun 5

ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es1054.eqiad.wmnet with OS trixie completed:

  • es1054 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606050545_marostegui_1236596_es1054.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Fri, Jun 5, 6:01 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es1054.eqiad.wmnet with OS trixie

Fri, Jun 5, 5:22 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Completed depooling of es1054 by marostegui@cumin1003: Upgrading es1054.eqiad.wmnet

Fri, Jun 5, 5:21 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Upgrading es1054.eqiad.wmnet

Fri, Jun 5, 5:21 AM · DBA
ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage started by jasmine@cumin2002 for host kafka-main1010.eqiad.wmnet with OS trixie completed:

  • kafka-main1010 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606050139_jasmine_114239_kafka-main1010.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
Fri, Jun 5, 1:55 AM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores
ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage was started by jasmine@cumin2002 for host kafka-main1010.eqiad.wmnet with OS trixie

Fri, Jun 5, 1:16 AM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores
ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage started by jasmine@cumin2002 for host kafka-main1007.eqiad.wmnet with OS trixie completed:

  • kafka-main1007 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606050040_jasmine_100324_kafka-main1007.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
Fri, Jun 5, 12:56 AM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores
ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage was started by jasmine@cumin2002 for host kafka-main1007.eqiad.wmnet with OS trixie

Fri, Jun 5, 12:17 AM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores

Thu, Jun 4

ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage started by jasmine@cumin2002 for host kafka-main1006.eqiad.wmnet with OS trixie completed:

  • kafka-main1006 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606042340_jasmine_86465_kafka-main1006.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 11:57 PM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores
ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage was started by jasmine@cumin2002 for host kafka-main1006.eqiad.wmnet with OS trixie

Thu, Jun 4, 11:20 PM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores
ops-monitoring-bot added a comment to T427393: EQSIN: Setup VRRP on both routers for the new subnets.

Cookbook cookbooks.sre.hosts.reimage started by cmooney@cumin1003 for host cp5030.eqsin.wmnet with OS trixie completed:

  • cp5030 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Host successfully migrated to the new VLAN
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606041949_cmooney_1168445_cp5030.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 8:18 PM · Patch-For-Review, Infrastructure-Foundations, SRE, netops, ops-eqsin, DC-Ops
ops-monitoring-bot added a comment to T427393: EQSIN: Setup VRRP on both routers for the new subnets.

Cookbook cookbooks.sre.hosts.reimage was started by cmooney@cumin1003 for host cp5030.eqsin.wmnet with OS trixie

Thu, Jun 4, 7:08 PM · Patch-For-Review, Infrastructure-Foundations, SRE, netops, ops-eqsin, DC-Ops
ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage started by jayme@cumin1003 for host kafka-main2007.codfw.wmnet with OS trixie completed:

  • kafka-main2007 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606041524_jayme_1135974_kafka-main2007.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 3:41 PM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores
ops-monitoring-bot added a comment to T426804: filerevision view should not filter out deleted file revisions.

Cookbook cookbooks.sre.wikireplicas.update-views started by ladsgroup completed:

  • an-redacteddb1001.eqiad.wmnet (PASS)
    • Ran Puppet agent
    • Ran 'maintain-views --replace --auto-depool --all-databases --table filerevision_userindex'
Thu, Jun 4, 3:40 PM · DBA, SecTeam-Processed, Privacy Engineering, cloud-services-team, Data-Services
ops-monitoring-bot added a comment to T426804: filerevision view should not filter out deleted file revisions.

Cookbook cookbooks.sre.wikireplicas.update-views run by ladsgroup: Started updating wiki replica views

Thu, Jun 4, 3:29 PM · DBA, SecTeam-Processed, Privacy Engineering, cloud-services-team, Data-Services
ops-monitoring-bot added a comment to T426804: filerevision view should not filter out deleted file revisions.

Cookbook cookbooks.sre.wikireplicas.update-views started by ladsgroup completed:

  • an-redacteddb1001.eqiad.wmnet (PASS)
    • Ran Puppet agent
    • Ran 'maintain-views --replace --auto-depool --all-databases --table filerevision'
Thu, Jun 4, 3:13 PM · DBA, SecTeam-Processed, Privacy Engineering, cloud-services-team, Data-Services
ops-monitoring-bot added a comment to T426804: filerevision view should not filter out deleted file revisions.

Cookbook cookbooks.sre.wikireplicas.update-views run by ladsgroup: Started updating wiki replica views

Thu, Jun 4, 3:05 PM · DBA, SecTeam-Processed, Privacy Engineering, cloud-services-team, Data-Services
ops-monitoring-bot added a comment to T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers.

Cookbook cookbooks.sre.hosts.reimage was started by jayme@cumin1003 for host kafka-main2007.codfw.wmnet with OS trixie

Thu, Jun 4, 2:56 PM · Patch-For-Review, ServiceOps new, ServiceOps-Datastores
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es1057.eqiad.wmnet with OS trixie completed:

  • es1057 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606041325_marostegui_1118960_es1057.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 1:41 PM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Migration of db1220.eqiad.wmnet completed

Thu, Jun 4, 1:13 PM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Completed pooling of db1220 by marostegui@cumin1003: Migration of db1220.eqiad.wmnet completed

Thu, Jun 4, 1:13 PM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es1057.eqiad.wmnet with OS trixie

Thu, Jun 4, 1:00 PM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Completed depooling of es1057 by marostegui@cumin1003: Upgrading es1057.eqiad.wmnet

Thu, Jun 4, 1:00 PM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Upgrading es1057.eqiad.wmnet

Thu, Jun 4, 12:59 PM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Starting pool of db1220 by marostegui@cumin1003: Migration of db1220.eqiad.wmnet completed

Thu, Jun 4, 12:28 PM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host db1220.eqiad.wmnet with OS trixie completed:

  • db1220 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606041204_marostegui_1097239_db1220.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 12:20 PM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host db1220.eqiad.wmnet with OS trixie

Thu, Jun 4, 11:42 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es2050.codfw.wmnet with OS trixie completed:

  • es2050 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606041123_marostegui_1091846_es2050.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 11:40 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Completed depooling of db1220 by marostegui@cumin1003: Upgrading db1220.eqiad.wmnet

Thu, Jun 4, 11:40 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Upgrading db1220.eqiad.wmnet

Thu, Jun 4, 11:37 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Migration of db1179.eqiad.wmnet completed

Thu, Jun 4, 11:32 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Completed pooling of db1179 by marostegui@cumin1003: Migration of db1179.eqiad.wmnet completed

Thu, Jun 4, 11:32 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es2050.codfw.wmnet with OS trixie

Thu, Jun 4, 11:00 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Completed depooling of es2050 by marostegui@cumin1003: Upgrading es2050.codfw.wmnet

Thu, Jun 4, 11:00 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Upgrading es2050.codfw.wmnet

Thu, Jun 4, 10:59 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Starting pool of db1179 by marostegui@cumin1003: Migration of db1179.eqiad.wmnet completed

Thu, Jun 4, 10:46 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host db1179.eqiad.wmnet with OS trixie completed:

  • db1179 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606041019_marostegui_1081578_db1179.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 10:38 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host es2057.codfw.wmnet with OS trixie completed:

  • es2057 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606040954_marostegui_1078552_es2057.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 10:11 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host db1179.eqiad.wmnet with OS trixie

Thu, Jun 4, 9:59 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Completed depooling of db1179 by marostegui@cumin1003: Upgrading db1179.eqiad.wmnet

Thu, Jun 4, 9:58 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Upgrading db1179.eqiad.wmnet

Thu, Jun 4, 9:57 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Migration of db1224.eqiad.wmnet completed

Thu, Jun 4, 9:39 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Completed pooling of db1224 by marostegui@cumin1003: Migration of db1224.eqiad.wmnet completed

Thu, Jun 4, 9:39 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host es2057.codfw.wmnet with OS trixie

Thu, Jun 4, 9:33 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Completed depooling of es2057 by marostegui@cumin1003: Upgrading es2057.codfw.wmnet

Thu, Jun 4, 9:32 AM · DBA
ops-monitoring-bot added a comment to T428050: Migrate es3 section to Debian Trixie.

Upgrading es2057.codfw.wmnet

Thu, Jun 4, 9:32 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Starting pool of db1224 by marostegui@cumin1003: Migration of db1224.eqiad.wmnet completed

Thu, Jun 4, 8:53 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host db1224.eqiad.wmnet with OS trixie completed:

  • db1224 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606040804_marostegui_1052418_db1224.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 8:21 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by marostegui@cumin1003 for host db1224.eqiad.wmnet with OS trixie

Thu, Jun 4, 7:43 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Completed depooling of db1224 by marostegui@cumin1003: Upgrading db1224.eqiad.wmnet

Thu, Jun 4, 7:42 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Upgrading db1224.eqiad.wmnet

Thu, Jun 4, 7:41 AM · DBA
ops-monitoring-bot added a comment to T426725: Migrate x3 section to Debian Trixie.

Migration of db1255.eqiad.wmnet completed

Thu, Jun 4, 7:39 AM · DBA
ops-monitoring-bot added a comment to T426725: Migrate x3 section to Debian Trixie.

Completed pooling of db1255 by cwilliams@cumin1003: Migration of db1255.eqiad.wmnet completed

Thu, Jun 4, 7:39 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Migration of db2191.codfw.wmnet completed

Thu, Jun 4, 7:24 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Completed pooling of db2191 by marostegui@cumin1003: Migration of db2191.codfw.wmnet completed

Thu, Jun 4, 7:24 AM · DBA
ops-monitoring-bot added a comment to T426725: Migrate x3 section to Debian Trixie.

Starting pool of db1255 by cwilliams@cumin1003: Migration of db1255.eqiad.wmnet completed

Thu, Jun 4, 6:53 AM · DBA
ops-monitoring-bot added a comment to T426725: Migrate x3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by cwilliams@cumin1003 for host db1255.eqiad.wmnet with OS trixie completed:

  • db1255 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606040635_cwilliams_1028532_db1255.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 6:51 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Starting pool of db2191 by marostegui@cumin1003: Migration of db2191.codfw.wmnet completed

Thu, Jun 4, 6:38 AM · DBA
ops-monitoring-bot added a comment to T427880: Migrate x1 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage started by marostegui@cumin1003 for host db2191.codfw.wmnet with OS trixie completed:

  • db2191 (WARN)
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh trixie OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202606040615_marostegui_1025774_db2191.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Skipping waiting for Icinga optimal status and not removing the downtime, --no-check-icinga was set
    • Updated Netbox data from PuppetDB
Thu, Jun 4, 6:32 AM · DBA
ops-monitoring-bot added a comment to T426725: Migrate x3 section to Debian Trixie.

Cookbook cookbooks.sre.hosts.reimage was started by cwilliams@cumin1003 for host db1255.eqiad.wmnet with OS trixie

Thu, Jun 4, 6:16 AM · DBA
ops-monitoring-bot added a comment to T426725: Migrate x3 section to Debian Trixie.

Completed depooling of db1255 by cwilliams@cumin1003: Upgrading db1255.eqiad.wmnet

Thu, Jun 4, 6:13 AM · DBA