Common information
- dashboard: https://grafana.wikimedia.org/d/O0nHhdhnz/network-probes-overview?var-job=probes/custom&var-module=All
- runbook: https://wikitech.wikimedia.org/wiki/Runbook#gerrit2003:443
- alertname: ProbeDown
- instance: gerrit2003:443
- job: probes/custom
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services-releng
Firing alerts
- dashboard: https://grafana.wikimedia.org/d/O0nHhdhnz/network-probes-overview?var-job=probes/custom&var-module=All
- description: gerrit2003:443 failed when probed by http_gerrit_tls_ip4 from codfw. Availability is 0%.
- logs: https://logstash.wikimedia.org/app/dashboards#/view/f3e709c0-a5f8-11ec-bf8e-43f1807d5bc2?_g=(filters:!((query:(match_phrase:(service.name:http_gerrit_tls_ip4)))))
- runbook: https://wikitech.wikimedia.org/wiki/Runbook#gerrit2003:443
- summary: Service gerrit2003:443 has failed probes (http_gerrit_tls_ip4)
- address: 208.80.153.116
- alertname: ProbeDown
- family: ip4
- instance: gerrit2003:443
- job: probes/custom
- module: http_gerrit_tls_ip4
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services-releng
- Source
- dashboard: https://grafana.wikimedia.org/d/O0nHhdhnz/network-probes-overview?var-job=probes/custom&var-module=All
- description: gerrit2003:443 failed when probed by http_gerrit_tls_ip6 from codfw. Availability is 0%.
- logs: https://logstash.wikimedia.org/app/dashboards#/view/f3e709c0-a5f8-11ec-bf8e-43f1807d5bc2?_g=(filters:!((query:(match_phrase:(service.name:http_gerrit_tls_ip6)))))
- runbook: https://wikitech.wikimedia.org/wiki/Runbook#gerrit2003:443
- summary: Service gerrit2003:443 has failed probes (http_gerrit_tls_ip6)
- address: 2620:0:860:4:208:80:153:116
- alertname: ProbeDown
- family: ip6
- instance: gerrit2003:443
- job: probes/custom
- module: http_gerrit_tls_ip6
- prometheus: ops
- severity: critical
- site: codfw
- source: prometheus
- team: collaboration-services-releng
- Source