Page MenuHomePhabricator

db1021 %iowait up
Closed, DeclinedPublic

Description

CPU %iowait is up, RAID battery capacity is down. Confirm cause, possibly replace battery.

Event Timeline

Springle claimed this task.
Springle raised the priority of this task from to Needs Triage.
Springle updated the task description. (Show Details)
Springle added a project: acl*sre-team.
Springle subscribed.

@Springle Btw, do you want a project tag for db related things?

Also see T84050:

Additionally, we're missing all kinds of MegaCli checks, like battery status errors, missing logical drives, predictive errors, different configured from runtime settings (e.g. the usual "configure WriteBack but active is WriteThrough") or even weird statuses such as battery train schedules

fgiunchedi triaged this task as Medium priority.Apr 1 2015, 11:25 AM
fgiunchedi subscribed.
jcrespo subscribed.

The current status of db1021 is Okish. The BBU is not in good state and one disk has errors, but the RAID is functional.

There is no actionable but to dismantle the hardware: T106847 and improve the RAID checks T84050.