T423027: 2026-04-12 Gerrit Outage (was: DiskSpace) was filed at 10:08 UTC and the first user report and became an outage at around 14:15 UTC. It didn't trigger a paging alert until
14:38:51 <+jinxer-wm> FIRING: [2x] ATSBackendErrorsHigh: ATS: elevated 5xx errors from gerrit.discovery.wmnet in eqiad #_page - https://wikitech.wikimedia.org/wiki/Apache_Traffic_Server#Debugging - https://alerts.wikimedia.org/?q=alertname%3DATSBackendErrorsHigh
Note: It was also rapidly rising in disk space from around midnight UTC so there was a 10 hour gap where something wrong could have been detected even earlier.
That's around 15 minutes after the first user report and 4.5 hours after automated monitoring detected a problem. It probably should have gone off a bit louder given Gerrit is a fairly critical part of Infrastructure and it caused secondary alerting from at least authdns-update failing and CI issues.