We noticed that when the statsd server name changed to cloudmetrics1001, this service failed, which prevented grid-based webservices from being restarted. Since the metrics servers are a pair, this seems like a bad design.
We also didn't notice the problem for some time, which suggests these are not high-value metrics.