Page MenuHomePhabricator

OfflineUncorrectableSector on mw1256 sda
Closed, ResolvedPublic

Description

This message was generated by the smartd daemon running on:

host name:  mw1256
DNS domain: eqiad.wmnet

The following warning/error was logged by the smartd daemon:

Device: /dev/sda [SAT], 1 Offline uncorrectable sectors

Device info:
WDC WD5003ABYX-18WERA0, 500 GB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists.

Event Timeline

@Joe The disk is out of warranty but I have kajillion 500GB disks lying around. The disk can be replaced anytime but a re-install will be needed. Let me know once the server is depooled and powered off.

Mentioned in SAL (#wikimedia-operations) [2018-02-08T07:27:33Z] <_joe_> depooled mw1256 from traffic, scap (faulty disk, T186535); now powering it off

RobH triaged this task as Medium priority.
RobH subscribed.

SAL states depooled & looks like its ready to go, powered down.

Mentioned in SAL (#wikimedia-operations) [2018-02-08T07:27:33Z] <_joe_> depooled mw1256 from traffic, scap (faulty disk, T186535); now powering it off

The disk has been replaced and needs a re-install

I tried to access mw1256.mgmt.eqiad.wmnet at 10.65.2.106 though that ends up being lead's console:

Debian GNU/Linux 8 lead ttyS1

lead login:

Mentioned in SAL (#wikimedia-operations) [2018-02-13T17:54:24Z] <godog> repool mw1256 after disk swap - T186535

This has been completed. resolving