Page MenuHomePhabricator

Neon sdb is failing
Closed, ResolvedPublic

Description

Logs below
Jun 12 08:08:10 neon kernel: [1950781.129229] sd 1:0:0:0: [sdb] Unhandled sense
code Jun 12 08:08:10 neon kernel: [1950781.129233] sd 1:0:0:0: [sdb] Result:
hostbyte=DID_OK driverbyte=DRIVER_SENSEJun 12 08:08:10 neon kernel:
[1950781.129239] sd 1:0:0:0: [sdb] Sense Key : Medium Error [current]
[descriptor]Jun 12 08:08:10 neon kernel: [1950781.129245] Descriptor sense data
with sense descriptors (in hex):Jun 12 08:08:10 neon kernel: [1950781.129249]
72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 Jun 12 08:08:10 neon kernel:
[1950781.129262] 00 04 3b ba Jun 12 08:08:10 neon kernel: [1950781.129268] sd
1:0:0:0: [sdb] Add. Sense: Unrecovered read error - auto reallocate failedJun
12 08:08:10 neon kernel: [1950781.129275] sd 1:0:0:0: [sdb] CDB: Read(10): 28
00 00 04 3b b8 00 00 08 00Jun 12 08:08:10 neon kernel: [1950781.129287]
end_request: I/O error, dev sdb, sector 277434
md has failed over to sda for this and other sectors so no data loss has
occurred.

Details

Reference
rt5290

Event Timeline

rtimport raised the priority of this task from to Medium.Dec 18 2014, 1:37 AM
rtimport added a project: ops-eqiad.
rtimport set Reference to rt5290.

Status changed from 'new' to 'open' by cmjohnson

-- i don't see where the raid array is degraded
<root at neon:~# cat /proc/mdstat Personalities : [linear] [multipath] [raid0]
[raid1] [raid6] [raid5] [raid4] [raid10] md0 : active raid1 sdb1[1] sda1[0]
9756544 blocks super 1> [2/2] [UU]
Chris Johnson
Wikimedia Foundation, Inc

-- While the disk may be showing signs of a potential failure. I can not
request a replacement unless the disk fails. Resolving the ticket at this time.
Chris Johnson
Wikimedia Foundation, Inc

Status changed from 'open' to 'resolved' by cmjohnson

Dzahn changed the visibility from "WMF-NDA (Project)" to "Public (No Login Required)".Oct 1 2018, 10:41 PM
Dzahn changed the edit policy from "WMF-NDA (Project)" to "All Users".