In T133744#2381269, @BBlack wrote:@Yurik - T137617 does detailed service monitoring on each node (and probably shouldn't page people). What we're lacking is the higher-level "is this service alive?" check (which is usually just a simple request) pointed at http://kartotherian.svc.codfw.wmnet:6533 (and ditto for eqiad once it's alive), as well as the public-side checks on https://maps.wikimedia.org, both of which should alert/page on downtime.
Other services defined here