Page MenuHomePhabricator

SystemdUnitDownForLong cloudcontrol1005:9100 Unit drain_rabbitmq_notification_error.service on node cloudcontrol1005 has been down for long.
Closed, ResolvedPublic

Description

Common information

  • alertname: SystemdUnitDownForLong
  • cluster: wmcs
  • instance: cloudcontrol1005:9100
  • job: node
  • name: drain_rabbitmq_notification_error.service
  • prometheus: ops
  • severity: task
  • site: eqiad
  • source: prometheus
  • state: failed
  • team: wmcs
  • type: simple

Firing alerts


Event Timeline

dcaro claimed this task.

This has been fixed