If we wish to be able to act to ensure that our service it up it is valuable to both monitor what fraction of the time it is down as well as alert us in the event this happens.
These should alert to the existing monitoring email.
Some wbstack.com alerts that were setup in terraform can be found at https://github.com/wbstack/deploy/blob/main/tf/monitoring_alert_policy.tf
Useful links:
- Defining uptime checks https://cloud.google.com/monitoring/uptime-checks/introduction
- Notification channels https://cloud.google.com/monitoring/support/notification-options
- Alerting https://cloud.google.com/monitoring/alerts
A/C:
- Monitoring exists to check the uptime of all our public facing services
- Alerting exists to notify developers in a real time way to wb-cloud-monitoring@wikimedia.de of one of these public facing services has been unavailable for 10 mins.