Page MenuHomePhabricator

ManagementSSHDown
Closed, DuplicatePublic

Description

Common information

  • alertname: ManagementSSHDown
  • instance: lvs2013.mgmt:22
  • job: probes/mgmt
  • module: ssh_banner
  • prometheus: ops
  • rack: C2
  • severity: task
  • site: codfw
  • source: prometheus
  • team: dcops

Firing alerts


  • dashboard: TODO
  • description: The management interface at lvs2013.mgmt:22 has been unresponsive for multiple hours.
  • runbook: https://wikitech.wikimedia.org/wiki/Dc-operations/Hardware_Troubleshooting_Runbook
  • summary: Unresponsive management for lvs2013.mgmt:22
  • alertname: ManagementSSHDown
  • instance: lvs2013.mgmt:22
  • job: probes/mgmt
  • module: ssh_banner
  • prometheus: ops
  • rack: C2
  • severity: task
  • site: codfw
  • source: prometheus
  • team: dcops
  • Source