See also T90542
We currently have no monitoring for availability of exec nodes. This means we are not notified if a queue is overwhelmed and e.g. no more webservice processes can start.
See also https://wikitech.wikimedia.org/wiki/Incident_documentation/20150817-ToolLabs-WebgridOutage