cr4-ulsfo rebooted today, unexpectedly, at 23:19:34 UTC (= 19:20 PST in the paste below).
System booted: 2019-04-16 23:19:34 UTC (00:05:42 ago)
19:20 <+icinga-wm> PROBLEM - Host cr4-ulsfo IPv6 is DOWN: PING CRITICAL - Packet loss = 100% 19:20 <+icinga-wm> PROBLEM - Host cr4-ulsfo is DOWN: PING CRITICAL - Packet loss = 100% 19:20 <+icinga-wm> PROBLEM - Host re0.cr4-ulsfo is DOWN: PING CRITICAL - Packet loss = 100% 19:21 <+icinga-wm> PROBLEM - Router interfaces on cr3-ulsfo is CRITICAL: CRITICAL: host 198.35.26.192, interfaces up: 59, down: 3, dormant: 0, excluded: 0, unused: 0: https://wikitech.wikimedia.org/wiki/Network_monitoring%23Router_interface_down 19:23 < mutante> volans: looks like there was an issue with the link between 3 and 4 before https://phabricator.wikimedia.org/T196030 19:24 <+icinga-wm> RECOVERY - Router interfaces on cr3-ulsfo is OK: OK: host 198.35.26.192, interfaces up: 68, down: 0, dormant: 0, excluded: 0, unused: 0 https://wikitech.wikimedia.org/wiki/Network_monitoring%23Router_interface_down 19:24 <+icinga-wm> RECOVERY - Host cr4-ulsfo is UP: PING OK - Packet loss = 0%, RTA = 75.46 ms 19:25 <+icinga-wm> PROBLEM - PyBal BGP sessions are established on lvs4007 is CRITICAL: 0 le 0 https://grafana.wikimedia.org/dashboard/db/pybal-bgp?var-datasource=ulsfo+prometheus/ops 19:25 < volans> System booted: 2019-04-16 23:19:34 UTC (00:05:42 ago) 19:25 <+icinga-wm> RECOVERY - Host cr4-ulsfo IPv6 is UP: PING OK - Packet loss = 0%, RTA = 75.40 ms 19:25 <+icinga-wm> RECOVERY - Host re0.cr4-ulsfo is UP: PING OK - Packet loss = 0%, RTA = 75.05 ms 19:26 <+icinga-wm> RECOVERY - PyBal BGP sessions are established on lvs4007 is OK: (C)0 le (W)0 le 1 https://grafana.wikimedia.org/dashboard/db/pybal-bgp?var-datasource=ulsfo+prometheus/ops