Page MenuHomePhabricator

deployment-kafka-logging01 is down for maintenance because Trixie is not yet well supported
Closed, ResolvedPublic

Description

I tried to re-create deployment-kafka-logging01 in deployment-prep with Debian Trixie and Kafka 3.7 to test the new versions, but for some reason puppet doesn't work. I noticed that puppet is not installed correctly, since we don't support 8:

elukey@deployment-kafka-logging01:~$ dpkg -l | grep puppet
ii  puppet-agent                            8.10.0-5                             all          configuration management system, agent

At the moment the Beta logs are down, I'll prioritize this on Monday. Sorry for the inconvenience!

Event Timeline

elukey triaged this task as High priority.
bd808 renamed this task from deployment-kafka-logging01 is down for maintenance to deployment-kafka-logging01 is down for maintenance because Trixie is not yet well supported.Mar 13 2026, 5:21 PM
bd808 moved this task from To Triage to Puppet errors on the Beta-Cluster-Infrastructure board.

Change #1251539 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] P:kafka::broker::monitoring: Fix legacy facts

https://gerrit.wikimedia.org/r/1251539

Change #1251540 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] confluent: kafka::broker: Fix legacy facts

https://gerrit.wikimedia.org/r/1251540

elukey@deployment-kafka-logging01:~$ apt-cache policy puppet-agent
puppet-agent:
  Installed: 8.10.0-5
  Candidate: 7.23.0-1+wmf13u1
  Version table:
 *** 8.10.0-5 500
        500 http://mirrors.wikimedia.org/debian trixie/main amd64 Packages
        100 /var/lib/dpkg/status
     7.23.0-1+wmf13u1 1002
       1001 http://apt.wikimedia.org/wikimedia trixie-wikimedia/component/puppet7 amd64 Packages

I just forced an apt-get install puppet-agent and the right one was installed, so it may be a race condition? Anyway, now the issue is:

Error: Could not retrieve catalog from remote server: Error 500 on SERVER: Server Error: Evaluation Error: Error while evaluating a Resource Statement, Evaluation Error: Error while evaluating a Function Call, Could not find template 'java/java-21.security.erb' (file: /srv/puppet_code/environments/production/modules/java/manifests/package.pp, line: 50, column: 24) (file: /srv/puppet_code/environments/production/modules/java/manifests/init.pp, line: 21) on node deployment-kafka-logging01.deployment-prep.eqiad1.wikimedia.cloud

That makes sense - we don't have a java-21 security config yet in puppet. I am going to create a task to do it, I'll chat with Moritz about it as well.

Next steps:

  • Add the new java-21 security config to puppet - T420083
  • Rollout @taavi's patches to fix facts in kafka classes to be ready for Puppet 8.
  • Fix remaining issues with Kafka on Trixie/Java 21

After that we should be able to have a brand new stack back :)

Kafka 3.7 is running on deployment-kafka-logging01 with Debian Trixie, first one of its kind!

@colewhite I see logstash consumer groups connecting, when you have a moment could you verify if everything works for beta logs?

@colewhite I see logstash consumer groups connecting, when you have a moment could you verify if everything works for beta logs?

I see logs coming in again. Thank you!

elukey claimed this task.

Change #1251539 merged by Elukey:

[operations/puppet@production] P:kafka::broker::monitoring: Fix legacy facts

https://gerrit.wikimedia.org/r/1251539

Change #1251540 merged by Elukey:

[operations/puppet@production] confluent: kafka::broker: Fix legacy facts

https://gerrit.wikimedia.org/r/1251540