Page MenuHomePhabricator

SMART alerts on db1069
Closed, DuplicatePublic

Description

db1069 raised smart alerts:

Enclosure Device ID: 32
Slot Number: 2
Drive's position: DiskGroup: 0, Span: 1, Arm: 0
Enclosure position: 1
Device Id: 2
WWN: 5000C50071B22C58
Sequence Number: 2
Media Error Count: 5
Other Error Count: 6
Predictive Failure Count: 1
Last Predictive Failure Event Seq Number: 39039
PD Type: SAS

Raw Size: 558.911 GB [0x45dd2fb0 Sectors]
Non Coerced Size: 558.411 GB [0x45cd2fb0 Sectors]
Coerced Size: 558.375 GB [0x45cc0000 Sectors]
Sector Size:  0
Firmware state: Online, Spun Up
Device Firmware Level: ES66
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x5000c50071b22c59
SAS Address(1): 0x0
Connected Port Number: 0(path0)
Inquiry Data: SEAGATE ST3600057SS     ES666SL82TJY
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :33C (91.40 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Port-1 :
Port status: Active
Port's Linkspeed: Unknown
Drive has flagged a S.M.A.R.T alert : Yes
=== START OF INFORMATION SECTION ===
Vendor:               SEAGATE
Product:              ST3600057SS
Revision:             ES66
User Capacity:        600,127,266,816 bytes [600 GB]
Logical block size:   512 bytes
Rotation Rate:        15000 rpm
Form Factor:          3.5 inches
Logical Unit id:      0x5000c50071b22c5b
Serial number:        6SL82TJY
Device type:          disk
Transport protocol:   SAS (SPL-3)
Local Time is:        Sat May  4 06:10:26 2019 UTC
SMART support is:     Available - device has SMART capability.
SMART support is:     Enabled
Temperature Warning:  Disabled or Not Supported

=== START OF READ SMART DATA SECTION ===
SMART Health Status: FAILURE PREDICTION THRESHOLD EXCEEDED: ascq=0x5 [asc=5d, ascq=5]

Current Drive Temperature:     33 C
Drive Trip Temperature:        68 C

Elements in grown defect list: 2050

Vendor (Seagate) cache information
  Blocks sent to initiator = 1646500645
  Blocks received from initiator = 230321087
  Blocks read from cache and sent to initiator = 3672442191
  Number of read and write commands whose size <= segment size = 3272514569
  Number of read and write commands whose size > segment size = 26

Vendor (Seagate/Hitachi) factory information
  number of hours powered up = 43903.07
  number of minutes until next internal SMART test = 18

Error counter log:
           Errors Corrected by           Total   Correction     Gigabytes    Total
               ECC          rereads/    errors   algorithm      processed    uncorrected
           fast | delayed   rewrites  corrected  invocations   [10^9 bytes]  errors
read:   2502743804        5         0  2502743809   2502743809    1115969.441           0
write:         0        0         0         0          0     370199.940           0
verify: 1638460598     3164         0  1638463762   1638463765     153830.245           7

Non-medium error count:      235

SMART Self-test log
Num  Test              Status                 segment  LifeTime  LBA_first_err [SK ASC ASQ]
     Description                              number   (hours)
# 1  Background short  Completed                  32       2                 - [-   -    -]
# 2  Background long   Completed                  32       2                 - [-   -    -]
# 3  Background short  Completed                  32       1                 - [-   -    -]

Long (extended) Self Test duration: 6400 seconds [106.7 minutes]

Event Timeline

Thanks @jijiki for creating the task. We are no longer creating tasks for predictive failures, we let them fail so the task gets created automatically.
We track the predictive failures at T208323: Predictive failures on disk S.M.A.R.T. status
Also this host, db1069, will soon be decommissioned (hopefully!) T217396: Decommission db1061-db1073