This morning I noticed in icinga the following alarm:
ocg1003;OCG health;WARNING;HARD;3;WARNING: connection error: HTTPConnectionPool(host='localhost', port=8000): Read timed out. (read timeout=5)
which results in a warning if the read times out or any connection error happens.
This should not only be critical, but also we should get paged when any ocg server is unreachable until T120077 is solved, given right now any malfunction of a single ocg server results in user-noticeable downtime.