Page MenuHomePhabricator

[ceph] Enable disk failure prediciton
Open, HighPublic

Description

It's currently setup on codfw1.

Now that we have device metrics being collected in both clusters, we can enable this module to help detect drives with issues.

It requires installing the package ceph-mgr-diskprediction-local and enabling the module:

ceph mgr module enable diskprediction_local
ceph config set global device_failure_prediction_mode local

Event Timeline

dcaro triaged this task as High priority.
dcaro renamed this task from [ceph] Unable disk failure prediciton to [ceph] Enable disk failure prediciton.Oct 27 2023, 3:46 PM
fnegri changed the status of subtask T306820: [ceph] Upgrade to v16 from Open to In Progress.Jul 15 2025, 1:42 PM
Aklapper subscribed.

@dcaro Removing task assignee as this open task has been assigned for more than two years - See the email sent to task assignee on 2025-11-25.
Please assign this task to yourself again if you still realistically [plan to] work on this task - it would be welcome! :)
If this task has been resolved in the meantime, or should not be worked on by anybody ("declined"), please update its task status via "Add Action… 🡒 Change Status".
Also see https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup for tips how to best manage your individual work in Phabricator. Thanks!