Page MenuHomePhabricator

ps1-d1-eqiad and ps1-d6-eqiad down
Closed, ResolvedPublic

Description

Hi SRE folks,

the two PS mentioned above are down. Please see conversation below and act as required. Not tagging ops-eqiad as I don’t know impact fully and don’t want it to look like I’m authorising anything.

21:07:28 <icinga-wm> PROBLEM - Host ps1-d1-eqiad is DOWN: PING CRITICAL - Packet loss = 100%

21:07:40 <icinga-wm> PROBLEM - Host ps1-d6-eqiad is DOWN: PING CRITICAL - Packet loss = 100%

21:17:41 <papaul> RhinosF1: it looks like it not just the pdu it is all the servers connected to that mgmt switch

Event Timeline

Papaul triaged this task as Medium priority.Mar 20 2023, 9:24 PM
Papaul added a project: ops-eqiad.

Acknowledged, will investigate and update task.

@Cmjohnson i changed the cable out just now but have to step out. of data center i can continue to look at it later if its still bad

Jclark-ctr claimed this task.

Rebooted msw in rack d1 ,d6 looks to recovered