Page MenuHomePhabricator

Alert in need of triage: SystemdUnitFailed (instance puppetdb2003:9100)
Closed, DuplicatePublic

Description

The alert SystemdUnitFailed has started firing 1 month ago.

Labels
alertname=SystemdUnitFailed
instance=puppetdb2003:9100
name=generate_os_reports.service
prometheus=ops
severity=critical
site=codfw
source=prometheus
team=infrastructure-foundations
Annotations
NameContent
dashboardhttps://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status
descriptiongenerate_os_reports.service on puppetdb2003:9100
runbookhttps://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
summarygenerate_os_reports.service on puppetdb2003:9100
Links

Triage metadata. Do not delete.
fingerprint=892a271dc0d7df9c