Followup from a chat with @Volans, there are a few Prometheus-based alerts in Puppet that should be straightforward to migrate to alerting rules (i.e. alerts.git or puppet if for some reason we want to set alerts "administratively" since alerts.git is self-service even for non-root / non-sre).
Specifically:
- modules/profile/manifests/monitoring.pp: monitoring::check_prometheus { 'smart_healthy':
-
modules/profile/manifests/monitoring.pp: monitoring::check_prometheus { 'edac_correctable_errors': -
modules/profile/manifests/monitoring.pp: monitoring::check_prometheus { 'edac_syslog_events': -
modules/profile/manifests/monitoring.pp: monitoring::check_prometheus { 'filesystem_avail_bigger_than_size': - modules/profile/manifests/prometheus/alerts.pp: monitoring::check_prometheus { 'widespread-puppet-agent-fail':
- modules/profile/manifests/prometheus/alerts.pp: monitoring::check_prometheus { 'widespread-puppet-agent-no-resources':
The EDAC checks will be dropped for some ipmi based monitoring
These could be either still valid (e.g. smart_healthy) or could be possibly discarded as not relevant anymore (e.g. filesystem_avail_bigger_than_size)