Page MenuHomePhabricator

Broken disk on thanos-be1003
Closed, ResolvedPublic

Description

thanos-be1003 has a disk error flagging in Icinga:

DISK CRITICAL - /srv/swift-storage/sdn1 is not accessible: Input/output error

The host is slightly out of warranty, but maybe we have a spare disk?

Event Timeline

MoritzMuehlenhoff triaged this task as Medium priority.
Jclark-ctr subscribed.

Replaced hdd with spare hdd

2023-08-29 18:36:50 PDR3 Disk 11 in Backplane 1 of Integrated RAID Controller 1 is not functioning correctly. Part Number = TH0XPJ47SGT0003K01PDA00

imported foreign configuration to raid controller

Mentioned in SAL (#wikimedia-operations) [2023-08-30T08:51:10Z] <Emperor> stopping puppet to fix broken drive labelling after disk swap thanos-be1003 T345079

puppet had spotted the new drive was /dev/sdm so made a new filesystem labelled swift-sdm1 on it (even though there was already an FS with that label mounted and in use). I fixed this up by hand, and the new drive is now serving as swift-sdn1.