Page MenuHomePhabricator

mr1-eqsin.oob IPv6 connectivity flapping
Closed, ResolvedPublic0 Story Points

Description

Icinga has been reporting the following alarm flapping from ~ Jul 13th:

PROBLEM - Host mr1-eqsin.oob IPv6 is DOWN: PING CRITICAL - Packet loss = 100%

At the same time, the ripe-atlas-eqsin IPv6 alert has fired too (not flapping though).

From icinga1001 and bast4002 the IPv6 pings are indeed showing a huge packet loss for the mr1's oob IP, but not for the RIPE anchor's one.

Event Timeline

elukey created this task.Jul 14 2019, 8:05 AM
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJul 14 2019, 8:05 AM
elukey triaged this task as High priority.Jul 14 2019, 8:05 AM

Mentioned in SAL (#wikimedia-operations) [2019-07-14T13:18:39Z] <godog> silence mr1-eqsin.oob IPv6 until tomorrow 8 UTC - T227967

ayounsi claimed this task.Jul 14 2019, 11:36 PM

Thanks, email sent to Equinix NOC.
So far I don't think there is a link between the ripe alerts and the oob alerts.

Mentioned in SAL (#wikimedia-operations) [2019-07-15T17:34:18Z] <cdanis> downtime mr1-eqsin.oob IPv6 for 20h T227967

So far I don't think there is a link between the ripe alerts and the oob alerts.

Well, seems like they are, as the return path from mr1 -> icinga1001 goes through HE, nothing we can do there though as Equinix controls that path.

/cc T228015

ayounsi closed this task as Resolved.Jul 15 2019, 9:12 PM

Seems like fixing T228015 fixed that issue as well.