We have recently got some issues with Mirror Maker that seemed to be related to the switch of Monolog from Kafka Analytics to Jumbo.
Mirror Maker seems to have some issues when producing/replicating topics from kafka main-eqiad to jumbo. This is a snippet of logs from kafka1020:
Mar 12 11:00:49 kafka1020 kafka-mirror-maker[31938]: [2018-03-12 11:00:49,500] ERROR Error when sending message to topic codfw.change-prop.transcludes.resource-change with key: null Mar 12 11:00:49 kafka1020 kafka-mirror-maker[31938]: [2018-03-12 11:00:49,500] ERROR Error when sending message to topic codfw.change-prop.transcludes.resource-change with key: null Mar 12 11:00:49 kafka1020 kafka-mirror-maker[31938]: [2018-03-12 11:00:49,501] ERROR Error when sending message to topic codfw.change-prop.transcludes.resource-change with key: null Mar 12 11:00:49 kafka1020 kafka-mirror-maker[31938]: [2018-03-12 11:00:49,501] ERROR Error when sending message to topic codfw.change-prop.transcludes.resource-change with key: null Mar 12 11:00:50 kafka1020 kafka-mirror-maker[31938]: where num_of_file > 0 Mar 12 11:00:50 kafka1020 kafka-mirror-maker[31938]: GC log rotation is turned off Mar 12 11:00:50 kafka1020 systemd[1]: kafka-mirror-main-eqiad_to_jumbo-eqiad.service: main process exited, code=exited, status=255/n/a Mar 12 11:00:50 kafka1020 systemd[1]: Unit kafka-mirror-main-eqiad_to_jumbo-eqiad.service entered failed state. Mar 12 11:00:53 kafka1020 systemd[1]: kafka-mirror-main-eqiad_to_jumbo-eqiad.service holdoff time over, scheduling restart. Mar 12 11:00:53 kafka1020 systemd[1]: Stopping Kafka MirrorMaker Instance of main-eqiad_to_jumbo-eqiad... Mar 12 11:00:53 kafka1020 systemd[1]: Starting Kafka MirrorMaker Instance of main-eqiad_to_jumbo-eqiad... Mar 12 11:00:53 kafka1020 systemd[1]: Started Kafka MirrorMaker Instance of main-eqiad_to_jumbo-eqiad. Mar 12 11:01:25 kafka1020 kafka-mirror-maker[34099]: Exception in thread "mirrormaker-thread-0" kafka.common.ConsumerRebalanceFailedException: kafka-mirror-main-eqiad_to_jumbo-eqiad Mar 12 11:01:25 kafka1020 kafka-mirror-maker[34099]: at kafka.consumer.ZookeeperConsumerConnector$ZKRebalancerListener.syncedRebalance(ZookeeperConsumerConnector.scala:660) Mar 12 11:01:25 kafka1020 kafka-mirror-maker[34099]: at kafka.consumer.ZookeeperConsumerConnector.kafka$consumer$ZookeeperConsumerConnector$$reinitializeConsumer(ZookeeperConsumerCo Mar 12 11:01:25 kafka1020 kafka-mirror-maker[34099]: at kafka.consumer.ZookeeperConsumerConnector$WildcardStreamsHandler.<init>(ZookeeperConsumerConnector.scala:1001) Mar 12 11:01:25 kafka1020 kafka-mirror-maker[34099]: at kafka.consumer.ZookeeperConsumerConnector.createMessageStreamsByFilter(ZookeeperConsumerConnector.scala:163) Mar 12 11:01:25 kafka1020 kafka-mirror-maker[34099]: at kafka.tools.MirrorMaker$MirrorMakerOldConsumer.init(MirrorMaker.scala:477) Mar 12 11:01:25 kafka1020 kafka-mirror-maker[34099]: at kafka.tools.MirrorMaker$MirrorMakerThread.run(MirrorMaker.scala:388) Mar 12 11:25:34 kafka1020 kafka-mirror-maker[34099]: [2018-03-12 11:25:34,881] WARN No broker partitions consumed by consumer thread kafka-mirror-main-eqiad_to_jumbo-eqiad_kafka1020 Mar 12 11:25:34 kafka1020 kafka-mirror-maker[34099]: [2018-03-12 11:25:34,881] WARN No broker partitions consumed by consumer thread kafka-mirror-main-eqiad_to_jumbo-eqiad_kafka1022 Mar 12 11:25:34 kafka1020 kafka-mirror-maker[34099]: [2018-03-12 11:25:34,881] WARN No broker partitions consumed by consumer thread kafka-mirror-main-eqiad_to_jumbo-eqiad_kafka1023
Yesterday (2018-03-11) at around 20:00 UTC mirror maker stopped producing to Jumbo. I've restarted mirror maker on kafka1020, it seemed to work but we ended up in the same situation. Restarting it now again to see if things improve (11:31 UTC, 2018-03-12).