Page MenuHomePhabricator

labservices1001/1002 sometimes unresponsive
Closed, DeclinedPublic

Description

Twice today icinga has complained about not being able to reach labservices1001. The first time this happened there were similar alerts for labservices1002.

The host is very slow to respond to ssh and/or sudo. Also, appending to the syslog seems to have stopped working.

Event Timeline

Change 500880 had a related patch set uploaded (by Andrew Bogott; owner: Andrew Bogott):
[operations/puppet@production] pdns-recursor: reduce maximum number of file descriptors

https://gerrit.wikimedia.org/r/500880

Change 500880 merged by Andrew Bogott:
[operations/puppet@production] pdns-recursor: reduce maximum number of file descriptors

https://gerrit.wikimedia.org/r/500880

I think this is moot since we're shutting down these systems. T221857