3 pages had happened in the last 2 days (alerts text: Socket timeout after 10 seconds):
- 2020-07-21 15:04:09 UTC 2020 (aprox)
- [2020-07-22 10:39:55] SERVICE ALERT: api.svc.codfw.wmnet;LVS api codfw port 80/tcp - MediaWiki API cluster- api.svc.eqiad.wmnet IPv4 #page;CRITICAL;HARD;3;CRITICAL - Socket timeout after 10 seconds
- [2020-07-22 16:28:38] SERVICE ALERT: api.svc.codfw.wmnet;LVS api codfw port 80/tcp - MediaWiki API cluster- api.svc.eqiad.wmnet IPv4 #page;CRITICAL;HARD;3;CRITICAL - Socket timeout after 10 seconds
The times are approximate (when alerts trigger) the queries fail at least twice before paging, and multiple times in SOFT state (once or twice) over the last 2 days.
There is not a strightforward reason why this is happening.
Interestingly, they seem to fail for icinga1001 and icinga2001 at different times (but are detected from both hosts).