Page MenuHomePhabricator

Degraded RAID on ms-be2025
Closed, InvalidPublic

Description

TASK AUTO-GENERATED by Nagios/Icinga RAID event handler

A degraded RAID (md) was detected on host ms-be2025. An automatic snapshot of the current RAID status is attached below.

Please sync with the service owner to find the appropriate time window before actually replacing any failed hardware.

CRITICAL: State: degraded, Active: 3, Working: 3, Failed: 1, Spare: 0

$ sudo /usr/local/lib/nagios/plugins/get-raid-status-md
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] [raid4] [raid10] 
md0 : active raid1 sdb1[1](F) sda1[0]
      58559488 blocks super 1.2 [2/1] [U_]
      
md1 : active (auto-read-only) raid1 sda2[0] sdb2[1]
      976320 blocks super 1.2 [2/2] [UU]
      
unused devices: <none>

Event Timeline

Mentioned in SAL (#wikimedia-operations) [2020-07-06T14:28:58Z] <godog> powercycle ms-be2025, no ssh available - T257214

Mentioned in SAL (#wikimedia-operations) [2020-07-06T14:36:38Z] <godog> reboot ms-be2025 for hw raid software upgrade - T257214

fgiunchedi added a subscriber: fgiunchedi.

Host came back clean, I've updated the hw raid firmware while I was at it