In T172681 no varnishkafka alarm fired and we didn't get any notice of the error until somebody looked in grafana by chance.
I tried in T172681 to tune the alarms but I failed, so I rolled-back to the previous version with a lower critical threshold (5k instead of 20k).
This task should check the following:
- Is this enough? Should we change metrics/thresholds?
- Check if the new alarm does not create a storm of alerts when a Kafka broker is restarted.


