Received from cron:
This message was generated by the smartd daemon running on: host name: kafka1012 DNS domain: eqiad.wmnet The following warning/error was logged by the smartd daemon: Device: /dev/sdh, SMART Failure: FAILURE PREDICTION THRESHOLD EXCEEDED: ascq=0x5 Device info: [SEAGATE ST32000444SS KS68], lu id: 0x5000c50025fcf02f, S/N: 9WM3F298, 2.00 TB
Double checked on the host:
elukey@kafka1012:~$ for el in `df -h | grep spool | cut -d " " -f 1`; do echo $el; sudo smartctl -a $el | grep defect; done /dev/sdg1 Elements in grown defect list: 0 /dev/sdj1 Elements in grown defect list: 0 /dev/sdb1 Elements in grown defect list: 0 /dev/sdi1 Elements in grown defect list: 0 /dev/sdk1 Elements in grown defect list: 0 /dev/sdl1 Elements in grown defect list: 0 /dev/sdf1 Elements in grown defect list: 0 /dev/sdd1 Elements in grown defect list: 0 /dev/sde1 Elements in grown defect list: 0 /dev/sda3 Elements in grown defect list: 0 /dev/sdc3 Elements in grown defect list: 0 /dev/sdh1 Elements in grown defect list: 4061
The host is scheduled to be decommed during the next two quarters but I'd prefer to swap the disk in advance to avoid any service disruption.