One follow up on https://wikitech.wikimedia.org/w/index.php?title=Incident_documentation/20180711-kafka-eqiad, is to verify that EventBus doesn't require any broker to be in sync with the current partition leader to consider an event enqueued.
Description
Description
Event Timeline
Comment Actions
This should be the case if I am reading the code correctly. Kafka Python by default works in this way (https://kafka-python.readthedocs.io/en/master/apidoc/KafkaProducer.html - acks=1). Going to add this info to the incident report.