This week four different app servers have OOMd and killed random processes:
mw1134
mw1132
mw1139
mw1138
I have rebooted all but 1138. 1138 I've depooled and left intact for investigation.
This week four different app servers have OOMd and killed random processes:
mw1134
mw1132
mw1139
mw1138
I have rebooted all but 1138. 1138 I've depooled and left intact for investigation.
@Andrew mw1138 is not depooled (anymore), its CPU and network graphs show it is serving traffic. Looking at http://ganglia.wikimedia.org/latest/graph.php?r=day&z=xlarge&h=mw1132.eqiad.wmnet&m=cpu_report&s=by+name&mc=2&g=network_report&c=API+application+servers+eqiad it was idle for a few hours, but then it somehow got repooled..