We have an alert firing at the moment:
The number of events shows as lagging for each partition of the eventlogging_processor_client_side_00 topic has just jumped from around 36K to around 93K.
(EventLoggingKafkaLag) firing: Kafka consumer lag for event logging over threshold for past 15 min.
Runbook: https://wikitech.wikimedia.org/wiki/Analytics/Systems/EventLogging/Administration
Graphs: https://grafana.wikimedia.org/d/000000484/kafka-consumer-lag?orgId=1&prometheus=ops&var-cluster=jumbo-eqiad&var-topic=All&var-consumer_group=eventlogging_processor_client_side_00
Alert: https://alerts.wikimedia.org/?q=alertname%3DEventLoggingKafkaLag