As part of progressively reducing Icinga' scope we should be moving off it all paging checks/alerts. This will also help improving paging alerts reliability (e.g. {T294166}) because we'll be using the VO API exclusively, as opposed to the email transport.
== List of current (April 2022) paging alerts in Icinga ==
=== Prometheus-based (via Icinga `check_prometheus`) ===
* [x] excessive RX traffic on LVS interfaces
* [x] not enough php-fpm workers
* [x] reduced availability (i.e. high 5xx) for ats-tls and varnish
* [x] high rate of NEL errors
=== Native Icinga/NRPE checks ===
* [ ] zookeeper server
* [x] LVS/service::catalog checks. Will be removed by {T291946}
* [ ] MariaDB alerts (replica, disk space, read only, mysqld processes not running, etc)
* [ ] cfssl signer per-CA and cfssl-multirootca unit status
* [ ] acme-chief unit status
* [ ] Corp OIT ldap mirror
* [ ] etcd replication
* [ ] kafka broker server
* [x] exim queue
* [x] fastnetmon is alerting
* [ ] phabricator.wikimedia.org unreachable / ssl expiring
* [ ] ircd
* [ ] auth and recursive DNS
* [ ] elasticsearch health check for frozen writes
* [ ] "wiki content on commons" (and ssl expiry)
* [ ] superset (tcp/http) check
Note some users' (e.g. WMCS, fundraising) checks will be tackled as a separate effort