Common information
- dashboard: https://grafana.wikimedia.org/d/000000377/host-overview?orgId=1&var-server=cloudvirt1036
- description: Unit wmf_auto_restart_virtlogd.service on node cloudvirt1036 has been down for long.
- runbook: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/SystemdUnitDown
- summary: The systemd unit wmf_auto_restart_virtlogd.service on node cloudvirt1036 has been failing for more than two hours.
- alertname: SystemdUnitDown
- cluster: wmcs
- instance: cloudvirt1036:9100
- job: node
- name: wmf_auto_restart_virtlogd.service
- prometheus: ops
- severity: critical
- site: eqiad
- source: prometheus
- state: failed
- team: wmcs
- type: oneshot
Firing alerts
- dashboard: https://grafana.wikimedia.org/d/000000377/host-overview?orgId=1&var-server=cloudvirt1036
- description: Unit wmf_auto_restart_virtlogd.service on node cloudvirt1036 has been down for long.
- runbook: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Runbooks/SystemdUnitDown
- summary: The systemd unit wmf_auto_restart_virtlogd.service on node cloudvirt1036 has been failing for more than two hours.
- alertname: SystemdUnitDown
- cluster: wmcs
- instance: cloudvirt1036:9100
- job: node
- name: wmf_auto_restart_virtlogd.service
- prometheus: ops
- severity: critical
- site: eqiad
- source: prometheus
- state: failed
- team: wmcs
- type: oneshot
- Source