Icinga alerted for replica lag on clouddb1017 (analytics s1) and wmf-pt-kill for the same:
Notification Type: PROBLEM
Service: Check systemd state
Host: clouddb1017
Address: 10.64.32.61
State: CRITICAL
Date/Time: Thu Sept 9 02:28:37 UTC 2021
Notes URLs: https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state
Acknowledged by :
Additional Info:
CRITICAL - degraded: The following units failed: wmf-pt-kill@s1.service