Page MenuHomePhabricator

cloudvirt1014 crash
Closed, DuplicatePublic

Description

Fri 27 Dec 2019 05:10:13 -- I just got a page about cloudvirt1014 being down, and restarted it from the mgmt console.

Event Timeline

Surely this is related to T241313, although cloudvirt1013 and 1014 are in different racks

cloudvirt1013, cloudvirt1014, and cloudvirt1023 are the only cloudvirts running

Linux 4.9.0-11-amd64 #1 SMP Debian 4.9.189-3+deb9u2 (2019-11-11)

cloudvirt1023 is held back as a spare, so not under load.

Kernel is probably unrelated, they're running that new kernel because of the post-crash reboot, were running the standard kernel before that.

Note for DCOps: This still has VMs live. Please coordinate with WMCS before shutting down for troubleshooting.

aborrero subscribed.

I believe this is the same as T241494: Degraded RAID on cloudvirt1014. BBU replacement. Closing as duplicated.