We discovered in {} that a lot of the hard drive are experiencing bad sectors, but it flew under the radar as the current smartd alerts don't catch it.
This task is to create a node exporter to keep track of the number of bad sectors, and as a side-bonus to create an alert in two cases:
- when the number is relatively high (say >1k) -> warning
- when the number increases -> critical
The runbook should specify that those hard drives will need replacement, and for the increasing one specifically, to hurry up as the disk is degrading.