Labvirt1003 is misbehaving tonight -- ganglia can't reach it and I can't ssh in. Notably, labs VMs running there seem basically happy; I stopped one that was gobbling CPU to see if that would let me start ssh, to no avail.
I don't see any evidence that OOM killer has run. But, dmesg is full of this:
[1835838.047410] CPU17: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047413] CPU13: Core temperature above threshold, cpu clock throttled (total events = 10510)
[1835838.047414] CPU37: Core temperature above threshold, cpu clock throttled (total events = 10360)
[1835838.047417] CPU18: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047419] CPU19: Package temperature above threshold, cpu clock throttled (total events = 10815)
[1835838.047421] CPU22: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047722] CPU38: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047725] CPU47: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047728] CPU39: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047729] CPU15: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047732] CPU40: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047734] CPU43: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047736] CPU44: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047738] CPU16: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047739] CPU14: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.047741] CPU42: Package temperature above threshold, cpu clock throttled (total events = 10815)
[1835838.048039] CPU21: Package temperature above threshold, cpu clock throttled (total events = 10814)
[1835838.048040] CPU20: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.048041] CPU46: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.048043] CPU45: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.048045] CPU41: Package temperature above threshold, cpu clock throttled (total events = 10815)
[1835838.048048] CPU36: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.048049] CPU12: Package temperature above threshold, cpu clock throttled (total events = 10814)
[1835838.048051] CPU23: Package temperature above threshold, cpu clock throttled (total events = 10816)
[1835838.048052] CPU13: Package temperature above threshold, cpu clock throttled (total events = 10715)
[1835838.048053] CPU37: Package temperature above threshold, cpu clock throttled (total events = 10599)
[1835838.048444] CPU37: Core temperature/speed normal
[1835838.048445] CPU13: Core temperature/speed normal
[1835838.048446] CPU20: Package temperature/speed normal
[1835838.048448] CPU42: Package temperature/speed normal
[1835838.048449] CPU12: Package temperature/speed normal
[1835838.048449] CPU16: Package temperature/speed normal
[1835838.048451] CPU46: Package temperature/speed normal
[1835838.048452] CPU14: Package temperature/speed normal
[1835838.048453] CPU21: Package temperature/speed normal
[1835838.048453] CPU45: Package temperature/speed normal
[1835838.048454] CPU36: Package temperature/speed normal
[1835838.048455] CPU40: Package temperature/speed normal
[1835838.048457] CPU23: Package temperature/speed normal
[1835838.048458] CPU19: Package temperature/speed normal
[1835838.048459] CPU44: Package temperature/speed normal
[1835838.048460] CPU18: Package temperature/speed normal
[1835838.048461] CPU41: Package temperature/speed normal
[1835838.048462] CPU38: Package temperature/speed normal
[1835838.048463] CPU22: Package temperature/speed normal
[1835838.048464] CPU47: Package temperature/speed normal
[1835838.048465] CPU43: Package temperature/speed normal
[1835838.048467] CPU37: Package temperature/speed normal
[1835838.048467] CPU13: Package temperature/speed normal
[1835838.048469] CPU39: Package temperature/speed normal
[1835838.048471] CPU15: Package temperature/speed normal
[1835839.153874] CPU17: Package temperature/speed normal