Maybe as daemonsets, some things we would like to check are:
- NFS is behaving
- DNS is responding reliably
- We have connectivity to the internet
Maybe as daemonsets, some things we would like to check are:
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Resolved | • aborrero | T380882 openstack network problems (November 2024) | |||
| Open | None | T380985 [infra,k8s,o11y] Introduce worker checks |
I believe this is duplicate of T380892: [infra,k8s,o11y] introduce additional observability for calico and general networking
It's a bit wider, as the idea is to also test NFS and other things, not just calico/network (might be a parent task of that one?)
This might be a duplicated of the old T242637: Create a "health check" for Kubernetes worker nodes which validates local Toolforge config