Common information
- dashboard: https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status
- runbook: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
- alertname: SystemdUnitFailed
- instance: lists2001:9100
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services
Firing alerts
- dashboard: https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status
- description: sync-list-members-global-renamers.service on lists2001:9100
- runbook: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
- summary: sync-list-members-global-renamers.service on lists2001:9100
- alertname: SystemdUnitFailed
- instance: lists2001:9100
- name: sync-list-members-global-renamers.service
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services
- Source
- dashboard: https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status
- description: sync-list-members-global-sysops.service on lists2001:9100
- runbook: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
- summary: sync-list-members-global-sysops.service on lists2001:9100
- alertname: SystemdUnitFailed
- instance: lists2001:9100
- name: sync-list-members-global-sysops.service
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services
- Source
- dashboard: https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status
- description: sync-list-members-stewards-l.service on lists2001:9100
- runbook: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
- summary: sync-list-members-stewards-l.service on lists2001:9100
- alertname: SystemdUnitFailed
- instance: lists2001:9100
- name: sync-list-members-stewards-l.service
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services
- Source
- dashboard: https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status
- description: sync-list-members-stewards-usergroup.service on lists2001:9100
- runbook: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
- summary: sync-list-members-stewards-usergroup.service on lists2001:9100
- alertname: SystemdUnitFailed
- instance: lists2001:9100
- name: sync-list-members-stewards-usergroup.service
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services
- Source