an-worker1132 is facing multiple alerts like below one
PROBLEM - Check systemd state on an-worker1132 is CRITICAL: CRITICAL - degraded: The following units failed: export_smart_data_dump.service https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
There is indeed lots of loads on the CPU which seems to be linked to really slow disks: https://grafana.wikimedia.org/d/000000377/host-overview?orgId=1&var-server=an-worker1132&var-datasource=thanos&var-cluster=analytics&from=1677659659968&to=1677694130819
Need to investigate root cause and to remove the node from the cluster