Page MenuHomePhabricator

Upgrade ServiceOps roles from Bullseye to Debian Trixie
Open, HighPublic

Description

From OS report reference, those are the roles to migrate by Aug'2026:

  • role::redis::misc::master (4 host(s))
  • role::redis::misc::slave (4 host(s)) => should be done as part of T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0)
    • rdb1011.eqiad.wmnet
    • rdb1012.eqiad.wmnet
    • rdb1013.eqiad.wmnet
    • rdb1014.eqiad.wmnet
    • rdb2007.codfw.wmnet
    • rdb2008.codfw.wmnet
    • rdb2009.codfw.wmnet
    • rdb2010.codfw.wmnet
  • role::kafka::main (10 host(s))); should be done as part of T419216: Upgrade kafka-main to Kafka 3.x
    • kafka-main1006.eqiad.wmnet
    • kafka-main1007.eqiad.wmnet
    • kafka-main1008.eqiad.wmnet
    • kafka-main1009.eqiad.wmnet
    • kafka-main1010.eqiad.wmnet
    • kafka-main2006.codfw.wmnet
    • kafka-main2007.codfw.wmnet
    • kafka-main2008.codfw.wmnet
    • kafka-main2009.codfw.wmnet
    • kafka-main2010.codfw.wmnet
  • role::configcluster (6 host(s))
    • conf1007.eqiad.wmnet
    • conf1008.eqiad.wmnet
    • conf1009.eqiad.wmnet
    • conf2004.codfw.wmnet
    • conf2005.codfw.wmnet
    • conf2006.codfw.wmnet
  • role::parsoid::testing (1 host(s))
    • parsoidtest1001.eqiad.wmnet (hosts will be retired as part of T386246)
  • role::deployment_server::kubernetes (2 host(s))
    • deploy1003.eqiad.wmnet
    • deploy2002.codfw.wmnet

Event Timeline

@JMeybohm as discussed today:

So we'd be left with:

  • configcluster (conf*) hosts
  • deploy1003
  • deploy2002

@JMeybohm as discussed today:

Yes, we need to confirm compatibility between redis versions in Bullseye and Trixie and come up with a plan, but that's the goal.

@JMeybohm as discussed today:

Yes, we need to confirm compatibility between redis versions in Bullseye and Trixie and come up with a plan, but that's the goal.

IIRC Redis does not support skipping a major version, so we probably have to update to bullseye (redis 7) instead of trixie (redis 8).

For conf* hosts, I'd propose we start in codfw, by way of T418915: conf200[7-9] implementation tracking (i.e., conf200[4-6] go way). Once we're happy with that, we can proceed with migrating eqiad (a more complicated procedure, as it involves a proper etcd primary cluster switchover to codfw).

jijiki renamed this task from Migrate the Serviceops roles away from Bullseye to Upgrade ServiceOps roles to Debian Trixie.Mar 13 2026, 11:46 AM

@JMeybohm as discussed today:

Yes, we need to confirm compatibility between redis versions in Bullseye and Trixie and come up with a plan, but that's the goal.

IIRC Redis does not support skipping a major version, so we probably have to update to bullseye (redis 7) instead of trixie (redis 8).

Filed T419976 as a potential way forward, skipping upgrading to redis 7.

jijiki renamed this task from Upgrade ServiceOps roles to Debian Trixie to Upgrade ServiceOps roles from Bullseye to Debian Trixie.Mar 17 2026, 8:37 AM

My impression is that @Scott_French is taking the operations touching conf* hosts this quarter.

And for the deploy* hosts, given the dependency on ICU upgrade @Raine can you plan to do them?

@MLechvien-WMF - Yes, exactly. My plan is for the new conf* hosts being racked in codfw to run Trixie from day 1, which will pave the way for upgrading eqiad to Trixie as well (T419212#11703456). The actual process is going to be rather involved, and I'll need to spend a bit of time to iron out the procedure first.