Umbrella task for the upgrade of the kafka-main clusters to Kafka 3.7.
Additionally: preforming a vlan migration for select broker hosts in eqiad, and a Debian Trixie upgrade for all hosts in both clusters in T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers
Items (summarized):
kafka-main has 2 clusters, main-codfw and main-eqiad. We'll upgrade codfw first then move onto eqiad. We'll then upgrade the inter broker protocol for all brokers sequentially. Following, the upgrade, we'll also upgrade all hosts to Debian Trixie in T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers
- kafka-main codfw:
- Pin the inter broker protocol version on the brokers to hieradata/role/common/kafka/main.yaml:profile::kafka::broker::inter_broker_protocol_version: 1.1.0
- Perform a rolling upgrade of the brokers, that will restart with the pinned version configurations and the new kafka version, using host-by-host patches and service restart of kafka broker, e.g. https://gerrit.wikimedia.org/r/c/operations/puppet/+/1273863
- kafka-main2006
- kafka-main2007
- kafka-main2008
- kafka-main2009
- kafka-main2010
- Change the inter broker protocol version to match the new kafka version
- Set hieradata/role/common/kafka/main.yaml:profile::kafka::broker::inter_broker_protocol_version: 3.7
- Perform a final rolling restart of the brokers
- kafka-main eqiad:
- Pin the inter broker protocol version on the brokers to hieradata/role/common/kafka/main.yaml:profile::kafka::broker::inter_broker_protocol_version: 1.1.0
- Perform a rolling upgrade of the brokers, that will restart with the pinned version configurations and the new kafka version, using host-by-host patches and service restart of kafka broker, e.g. https://gerrit.wikimedia.org/r/c/operations/puppet/+/1273863
- kafka-main1006
- kafka-main1007
- kafka-main1008
- kafka-main1009
- kafka-main1010
- Change the inter broker protocol version to match the new kafka version
- Set hieradata/role/common/kafka/main.yaml:profile::kafka::broker::inter_broker_protocol_version: 3.7
- Perform a final rolling restart of the brokers
Additional migrations/upgrades & post-upgrade cleanup: see T427088: [Post kafka-main 3.7 upgrade work] Reimage brokers to trixie/JDK21 & vlan migrations on select brokers
Cluster states:
kafka-main codfw:
| Kafka Broker | Confluent distribution 77 | Inter-broker protocol |
| kafka-main2006.codfw.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main2007.codfw.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main2008.codfw.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main2009.codfw.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main2010.codfw.wmnet | Upgraded ✅ | 3.7 ✅ |
kafka-main eqiad:
| Kafka Broker | Confluent distribution 77 | Inter-broker protocol |
| kafka-main1006.eqiad.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main1007.eqiad.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main1008.eqiad.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main1009.eqiad.wmnet | Upgraded ✅ | 3.7 ✅ |
| kafka-main1010.eqiad.wmnet | Upgraded ✅ | 3.7 ✅ |
[0] - https://wikitech.wikimedia.org/wiki/Kafka/Administration#Upgrade_to_Kafka_3.7