Page MenuHomePhabricator

ProbeDown - *_experimental_wikidata_org
Closed, ResolvedPublic

Description

Common information

  • alertname: ProbeDown
  • family: ip4
  • job: probes/custom
  • prometheus: ops
  • severity: task
  • source: prometheus
  • team: collaboration-services

Firing alerts







Event Timeline

Dzahn renamed this task from ProbeDown to ProbeDown - *_experimental_wikidata_org.Jan 18 2024, 12:37 AM

These are new experimental sites added as part of T354658.

Dzahn added a subscriber: bking.

We talked about this and https://gerrit.wikimedia.org/r/c/operations/puppet/+/991680 was merged which disables these sites again for now, until Monday.

So the alert should be gone now.

Then we will have a special meeting about adding the sites to the legacy miscweb TSL cert.

prio low, but keeping open.

let's close it once the sites are added back and have been added to the cert.

we can use this to track the work for that.

should be resolved tomorrow once we added the new sites to the envoy TLS certs. This will happen in the team office hours.

the fix for this will be T355593 which is kind of a duplicate ticket but more specific about the root cause

Dzahn claimed this task.

The missing SANs have now been added to the envoy certs and we could confirm the new sites are working now.