Page MenuHomePhabricator

Move maps servers to Bookworm
Open, Needs TriagePublic

Description

kartotherian is being moved to wikikube. When that is complete, the server backend will be moved to Bookworm.

We'll be reusing six former ganeti nodes a test cluster to test the new stack (setup at T380144) until we eventually reimage the maps* nodes.

The high level plan is:

  • install a master bookworm node and fix all issues
  • install a replica bookworm node and fix all issues
  • the remaing replicas
  • test the OSM import on the new cluster
  • point wikikube/staging to use the bookworm cluster
  • if this works fine, we can enable expiration events for bookworm and disable it for buster
  • move the prod pods to the new cluster and eventually
  • reimage the maps* from buster to bookworm
  • failover production traffic to the actual maps* cluster

Details

SubjectRepoBranchLines +/-
operations/puppetproduction+1 -1
operations/puppetproduction+5 -1
operations/puppetproduction+13 -10
operations/puppetproduction+10 -3
labs/privatemaster+4 -0
operations/puppetproduction+6 -2
operations/puppetproduction+15 -8
operations/puppetproduction+2 -2
operations/puppetproduction+1 -0
operations/puppetproduction+1 -1
labs/privatemaster+8 -0
operations/puppetproduction+2 -0
operations/puppetproduction+9 -1
operations/puppetproduction+61 -32
operations/puppetproduction+16 -0
operations/puppetproduction+53 -0
operations/puppetproduction+0 -8
operations/puppetproduction+3 -4
operations/puppetproduction+5 -10
operations/debs/osmbordermaster+8 -3
operations/puppetproduction+29 -15
operations/puppetproduction+0 -16
operations/puppetproduction+42 -44
operations/puppetproduction+2 -9
operations/puppetproduction+6 -0
operations/puppetproduction+4 -14
operations/puppetproduction+12 -11
operations/puppetproduction+3 -5
operations/puppetproduction+17 -271
Show related patches Customize query in gerrit

Event Timeline

Change #1100784 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] maps: Remove support for osm2pgsql as OSM engine

https://gerrit.wikimedia.org/r/1100784

Change #1100788 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] osm_master: Avoid Ferm-specific syntax

https://gerrit.wikimedia.org/r/1100788

Change #1100784 merged by Muehlenhoff:

[operations/puppet@production] maps: Remove support for osm2pgsql as OSM engine

https://gerrit.wikimedia.org/r/1100784

Change #1100788 merged by Muehlenhoff:

[operations/puppet@production] osm_master: Avoid Ferm-specific syntax

https://gerrit.wikimedia.org/r/1100788

Change #1100456 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] maps: Allow disabling the installation of kartotherian

https://gerrit.wikimedia.org/r/1100456

Change #1100456 merged by Muehlenhoff:

[operations/puppet@production] maps: Allow disabling the installation of kartotherian

https://gerrit.wikimedia.org/r/1100456

Change #1101465 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] maps/postgresql: Support bookworm

https://gerrit.wikimedia.org/r/1101465

Change #1101465 merged by Muehlenhoff:

[operations/puppet@production] maps/postgresql: Support bookworm

https://gerrit.wikimedia.org/r/1101465

Change #1101864 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Configure new maps nodes with nftables

https://gerrit.wikimedia.org/r/1101864

Change #1101864 merged by Muehlenhoff:

[operations/puppet@production] Configure new maps nodes with nftables

https://gerrit.wikimedia.org/r/1101864

Change #1105836 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] maps::master Add toggle for planet sync

https://gerrit.wikimedia.org/r/1105836

Change #1105836 abandoned by Muehlenhoff:

[operations/puppet@production] maps::master Add toggle for planet sync

Reason:

Functionality already exists, just confusingly named

https://gerrit.wikimedia.org/r/1105836

Change #1105875 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] planet_sync: Cleanup time handling

https://gerrit.wikimedia.org/r/1105875

Change #1105876 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] planet_sync: Remove obsolete options

https://gerrit.wikimedia.org/r/1105876

Mentioned in SAL (#wikimedia-operations) [2024-12-20T09:43:53Z] <moritzm> imported imposm3 0.11.1-1+deb12u1 to apt.wikimedia.org T381565

Change #1105875 merged by Muehlenhoff:

[operations/puppet@production] planet_sync: Cleanup time handling

https://gerrit.wikimedia.org/r/1105875

Change #1105876 merged by Muehlenhoff:

[operations/puppet@production] planet_sync: Remove obsolete options

https://gerrit.wikimedia.org/r/1105876

Change #1108745 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] osm: On Bookworm create OSM users using system::sysuser

https://gerrit.wikimedia.org/r/1108745

Change #1108745 merged by Muehlenhoff:

[operations/puppet@production] osm: On Bookworm create OSM users using system::sysuser

https://gerrit.wikimedia.org/r/1108745

Change #1108773 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] maps::osm_master: Inline osm class

https://gerrit.wikimedia.org/r/1108773

Change #1108776 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/debs/osmborder@master] osmborder: Build for Bookworm and bump debhelper compat to 12

https://gerrit.wikimedia.org/r/1108776

Change #1108776 merged by Muehlenhoff:

[operations/debs/osmborder@master] osmborder: Build for Bookworm and bump debhelper compat to 13

https://gerrit.wikimedia.org/r/1108776

Mentioned in SAL (#wikimedia-operations) [2025-01-08T10:04:41Z] <moritzm> imported osmborder 0.1.0+wmf12u1 to apt.wikimedia.org/bookworm T381565

Change #1108773 merged by Muehlenhoff:

[operations/puppet@production] maps::osm_master: Inline osm class

https://gerrit.wikimedia.org/r/1108773

Change #1111634 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Make maps-test2001 a bookworm maps master node (WIP)

https://gerrit.wikimedia.org/r/1111634

Change #1111639 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] postgresql: Add support for Bookworm

https://gerrit.wikimedia.org/r/1111639

Change #1111639 merged by Muehlenhoff:

[operations/puppet@production] postgresql: Add support for Bookworm

https://gerrit.wikimedia.org/r/1111639

Change #1111647 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Remove profile::java from maps hosts

https://gerrit.wikimedia.org/r/1111647

Change #1111647 merged by Muehlenhoff:

[operations/puppet@production] Remove profile::java from maps hosts

https://gerrit.wikimedia.org/r/1111647

Change #1111659 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Add separate maps master/replica roles for the new Bookworm setup (WIP)

https://gerrit.wikimedia.org/r/1111659

Change #1112167 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Don't setup database config for tilerator on bookworm

https://gerrit.wikimedia.org/r/1112167

Change #1111659 merged by Muehlenhoff:

[operations/puppet@production] Add separate maps master/replica roles for the new Bookworm setup

https://gerrit.wikimedia.org/r/1111659

Change #1112218 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Add missing Hiera settings for new bookworm master roles

https://gerrit.wikimedia.org/r/1112218

Change #1112218 merged by Muehlenhoff:

[operations/puppet@production] Add missing Hiera settings for new bookworm master roles

https://gerrit.wikimedia.org/r/1112218

Change #1112167 merged by Muehlenhoff:

[operations/puppet@production] Don't setup database config for tilerator on bookworm

https://gerrit.wikimedia.org/r/1112167

Change #1113442 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Manage tile invalidation with a separate Hiera setting

https://gerrit.wikimedia.org/r/1113442

Change #1113442 merged by Muehlenhoff:

[operations/puppet@production] Manage tile invalidation with a separate Hiera setting

https://gerrit.wikimedia.org/r/1113442

Change #1113487 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Extend comment

https://gerrit.wikimedia.org/r/1113487

Change #1113487 merged by Muehlenhoff:

[operations/puppet@production] Extend comment

https://gerrit.wikimedia.org/r/1113487

Change #1113740 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[labs/private@master] Add stub secrets for new master_bookworm roles

https://gerrit.wikimedia.org/r/1113740

Change #1113740 merged by Muehlenhoff:

[labs/private@master] Add stub secrets for new master_bookworm roles

https://gerrit.wikimedia.org/r/1113740

Change #1113746 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] profile::maps::osm_master: Make tilerator_pass optional

https://gerrit.wikimedia.org/r/1113746

Change #1113746 merged by Muehlenhoff:

[operations/puppet@production] profile::maps::osm_master: Make tilerator_pass optional

https://gerrit.wikimedia.org/r/1113746

Change #1113810 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] osm_master: Provide a dummy variable for tilerator on bookworm roles

https://gerrit.wikimedia.org/r/1113810

Change #1113810 merged by Muehlenhoff:

[operations/puppet@production] osm_master: Provide a dummy variable for tilerator on bookworm roles

https://gerrit.wikimedia.org/r/1113810

Change #1114755 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] maps_bookworm: Initially disable replication/tile gen timers

https://gerrit.wikimedia.org/r/1114755

Change #1114755 merged by Muehlenhoff:

[operations/puppet@production] maps_bookworm: Initially disable replication/tile gen timers

https://gerrit.wikimedia.org/r/1114755

Change #1114769 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Add a separate Hiera option to control the waterlines import

https://gerrit.wikimedia.org/r/1114769

Change #1114769 merged by Muehlenhoff:

[operations/puppet@production] maps: Add a separate Hiera option to control the waterlines import

https://gerrit.wikimedia.org/r/1114769

Change #1111634 merged by Muehlenhoff:

[operations/puppet@production] Make maps-test2001 a bookworm maps master node

https://gerrit.wikimedia.org/r/1111634

Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host maps-test2001.codfw.wmnet with OS bookworm

Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host maps-test2001.codfw.wmnet with OS bookworm executed with errors:

  • maps-test2001 (FAIL)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bookworm OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202501301309_jmm_4089887_maps-test2001.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • The reimage failed, see the cookbook logs for the details. You can also try typing "sudo install-console maps-test2001.codfw.wmnet" to get a root shell, but depending on the failure this may not work.

Cookbook cookbooks.sre.hosts.reimage was started by jmm@cumin2002 for host maps-test2001.codfw.wmnet with OS bookworm

Cookbook cookbooks.sre.hosts.reimage started by jmm@cumin2002 for host maps-test2001.codfw.wmnet with OS bookworm completed:

  • maps-test2001 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Disabled Puppet
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bookworm OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202501301558_jmm_4144943_maps-test2001.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB

Change #1115846 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[labs/private@master] Add stub secrets for bookworm replica role

https://gerrit.wikimedia.org/r/1115846

Change #1115846 merged by Muehlenhoff:

[labs/private@master] Add stub secrets for bookworm replica role

https://gerrit.wikimedia.org/r/1115846

Change #1115850 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Make maps-test2002 a bookworm maps replica

https://gerrit.wikimedia.org/r/1115850

Change #1115863 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Enable maps-test2003 to maps-test2006 as additional maps bookworm replicas

https://gerrit.wikimedia.org/r/1115863

Change #1115850 merged by Muehlenhoff:

[operations/puppet@production] Make maps-test2002 a bookworm maps replica

https://gerrit.wikimedia.org/r/1115850

Change #1127042 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] osm_master: Tighten replication slot name on Bookworm

https://gerrit.wikimedia.org/r/1127042

Change #1127042 merged by Muehlenhoff:

[operations/puppet@production] osm_master: Tighten replication slot name on Bookworm

https://gerrit.wikimedia.org/r/1127042

Change #1128891 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] osm_replica: Fix Hiera variable

https://gerrit.wikimedia.org/r/1128891