Page MenuHomePhabricator

Offline uncorrectable sectors on poolcounter1002 /dev/sda
Closed, ResolvedPublic

Description

This message was generated by the smartd daemon running on:

host name:  poolcounter1002
DNS domain: eqiad.wmnet

The following warning/error was logged by the smartd daemon:

Device: /dev/sda [SAT], 7 Offline uncorrectable sectors

Device info:
SAMSUNG HE502HJ, 500 GB

You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists.

Event Timeline

I have several 500GB disks lying around. Let me when this server is depooled and powered off. Most likely will need a re-install.

Change 404967 had a related patch set uploaded (by Filippo Giunchedi; owner: Muehlenhoff):
[operations/mediawiki-config@master] Depool poolcounter1002

https://gerrit.wikimedia.org/r/404967

@Cmjohnson ok! I'll merge https://gerrit.wikimedia.org/r/404967 early next week to depool the machine and let you know.

RobH triaged this task as Medium priority.
RobH subscribed.

Setting to normal priority and assigned to @fgiunchedi until he merges his change and de-pools the system for maint. When its ready, please assign back to @Cmjohnson so he knows to proceed.

Thanks!

Change 404967 merged by jenkins-bot:
[operations/mediawiki-config@master] Depool poolcounter1002

https://gerrit.wikimedia.org/r/404967

Machine isn't in service now, @Cmjohnson all yours

@fgiunchedi This sever needs to be replaced and decommissioned. It is on the 5+ years old list. Please create a h/w request ticket to either assign a spare server or purchase a replacement and assign to @RobH.

The disk was replaced for you to either try and add it back or re-install. Historically, there have been issues trying to bring back after replacing /dev/sda.