Page MenuHomePhabricator

labstore1005 mgmt console unreachable via SSH
Closed, ResolvedPublic

Description

While the host itself is up and SSH in there works fine, SSH to the mgmt console times out after awhile. It's been alerting in icinga for an hour and a half or so.

Event Timeline

aborrero moved this task from Inbox to Soon! on the cloud-services-team (Kanban) board.

Mentioned in SAL (#wikimedia-operations) [2020-03-30T15:25:20Z] <jeh> add icinga 2h downtime and soft reset iDRAC on labstore1005.mgmt.eqiad.wmnet T247965

Seems fine after soft restarting the iDRAC card with racadm racreset. If this happens again we should look at upgrading the firmware, which may require a full restart of the host.