Page MenuHomePhabricator

Degraded RAID on analytics1057
Closed, ResolvedPublic

Description

TASK AUTO-GENERATED by Nagios/Icinga RAID event handler

A degraded RAID (megacli) was detected on host analytics1057. An automatic snapshot of the current RAID status is attached below.

Please sync with the service owner to find the appropriate time window before actually replacing any failed hardware.

CRITICAL: 1 failed LD(s) (Offline)

$ sudo /usr/local/lib/nagios/plugins/get_raid_status_megacli
=== RaidStatus (does not include components in optimal state)
name: Adapter #0

	Virtual Drive: 4 (Target Id: 4)
	RAID Level: Primary-0, Secondary-0, RAID Level Qualifier-0
	State: =====> Offline <=====
	Number Of Drives: 1
	Number of Spans: 1
	Current Cache Policy: WriteThrough, ReadAheadNone, Direct, No Write Cache if Bad BBU

		Span: 0 - Number of PDs: 1

			PD: 0 Information
			Enclosure Device ID: 32
			Slot Number: 3
			Drive's position: DiskGroup: 4, Span: 0, Arm: 0
			Media Error Count: 13
			Other Error Count: 1
			Predictive Failure Count: 0
			Last Predictive Failure Event Seq Number: 0

				Raw Size: 3.638 TB [0x1d1c0beb0 Sectors]
				Firmware state: =====> Failed <=====
				Media Type: Hard Disk Device
				Drive Temperature: 42C (107.60 F)

=== RaidStatus completed

Related Objects

Event Timeline

Dell now requires a new report. My self dispatch was kicked back. I resubmitted just now

Cmjohnson added a subscriber: elukey.

@elukey the disk has been replaced. you may need to add it back

Return tracking info
USPS 9202 3946 5301 2437 9877 74
FEDEX 8611918 2393026 74737795

Assigning to @elukey to add back and resolve task

Disk configured and Hadoop worker node back serving traffic, thanks!