One log to rule them all.
It would be good to have logstash for at least tools-ops logs, which includes
- basic system logging (dmesg / systemd)
- k8s logs (maybe not tools, but kube-system/toolforge components at least)
- toolforge component deployment actions
- maintain-harbor (that might be logging to the worker's local filesystem currently, we might not need to do that anymore if we can pull the logs directly from k8s, T383081: Persist important toolforge k8s components logs)
Systemd would include most (if not all) the VM services (ssh, puppet, redis, nginx, ...).
Being able then to search through it (kibana style) would help enormously to debug issues, specially for k8s components that might have the logs only in the pods or locally on the workers.