Page MenuHomePhabricator

Unresponsive management for maps2009.mgmt:22
Closed, ResolvedPublic

Description

Common information

  • alertname: ManagementSSHDown
  • instance: maps2009.mgmt:22
  • job: probes/mgmt
  • module: ssh_banner
  • prometheus: ops
  • rack: B6
  • severity: task
  • site: codfw
  • source: prometheus
  • team: dcops

Firing alerts


  • dashboard: TODO
  • description: The management interface at maps2009.mgmt:22 has been unresponsive for multiple hours.
  • runbook: https://wikitech.wikimedia.org/wiki/Management_Interfaces#Reset_the_management_card
  • summary: Unresponsive management for maps2009.mgmt:22
  • alertname: ManagementSSHDown
  • instance: maps2009.mgmt:22
  • job: probes/mgmt
  • module: ssh_banner
  • prometheus: ops
  • rack: B6
  • severity: task
  • site: codfw
  • source: prometheus
  • team: dcops
  • Source

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

maps2009 idrac continues to fail and requires a main board replacement to fix. The server is due to be refreshed later this quarter (Q1 25-26). Gonna leave the ticket open so that we don't get spam tickets. Please ignore until this server is decommed.

Jhancock.wm claimed this task.

server is decommed.