Page MenuHomePhabricator

tin has a failing hdd
Closed, ResolvedPublic

Description

Tin, the eqiad deployment server, has reported a bad disk:

This message was generated by the smartd daemon running on:

host name:  tin
DNS domain: eqiad.wmnet

The following warning/error was logged by the smartd daemon:

Device: /dev/bus/0 [megaraid_disk_00] [SAT], 1 Currently unreadable (pending) sectors

Device info:
WDC WD5003ABYX-18WERA0, S/N:WD-WMAYP4346290, WWN:5-0014ee-003577b59, FW:01.01S02, 500 GB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
The original message about this issue was sent at Mon Aug 28 13:18:29 2017 UTC
Another message will be sent in 24 hours if the problem persists.

This system has cabled (not hot swap) disk bays, and would require downtime to swap. Additionally, it was purchased on 2012-08-29, and is well out of warranty.

There will be a hardware-requests task made and linked to this, for the replacement of tin entirely. If it is denied, then this task will become active and the disk replacement will need to take place. If the hw request is approved, I'll steal this task back and invalidate it, since tin will then be slated for decom.