Page MenuHomePhabricator

Degraded RAID on thanos-be1003
Closed, ResolvedPublic

Description

TASK AUTO-GENERATED by Nagios/Icinga RAID event handler

A degraded RAID (megacli) was detected on host thanos-be1003. An automatic snapshot of the current RAID status is attached below.

Please sync with the service owner to find the appropriate time window before actually replacing any failed hardware.

CRITICAL: 1 failed LD(s) (Offline)

$ sudo /usr/local/lib/nagios/plugins/get-raid-status-megacli
=== RaidStatus (does not include components in optimal state)
name: Adapter #0

	Virtual Drive: 12 (Target Id: 12)
	RAID Level: Primary-0, Secondary-0, RAID Level Qualifier-0
	State: =====> Offline <=====
	Number Of Drives: 1
	Number of Spans: 1
	Current Cache Policy: WriteBack, ReadAhead, Direct, Write Cache OK if Bad BBU

		Span: 0 - Number of PDs: 1

			PD: 0 Information
			Enclosure Device ID: 32
			Slot Number: 10
			Drive's position: DiskGroup: 11, Span: 0, Arm: 0
			Media Error Count: 7
			Other Error Count: 0
			Predictive Failure Count: 0
			Last Predictive Failure Event Seq Number: 0

				Raw Size: 3.638 TB [0x1d1c0beb0 Sectors]
				Firmware state: =====> Offline <=====
				Media Type: Hard Disk Device
				Drive Temperature: 24C (75.20 F)

=== RaidStatus completed

Related Objects

Event Timeline

herron triaged this task as High priority.Mar 28 2022, 6:26 PM

Disk has been ordered

You have successfully submitted request SR1089086900.

The disk has been replaced and is back online

cmjohnson@thanos-be1003:~$ sudo megacli -CfgEachDskRaid0 WB RA Direct CachedBadBBU -a0

Adapter 0: Created VD 12
Configured physical device at Encl-32:Slot-10.

1 physical devices are Configured on adapter 0.