The host got reimaged:
2021-01-27 01:25 <legoktm@cumin1001> conftool action : set/pooled=yes; selector: name=mw2295.codfw.wmnet [production] 01:20 <legoktm@cumin1001> conftool action : set/pooled=no; selector: name=mw2295.codfw.wmnet [production] 00:52 <legoktm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2295.codfw.wmnet with reason: REIMAGE [production] 00:49 <legoktm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mw2295.codfw.wmnet with reason: REIMAGE [production]
Running scap today complains about an erroneous ssh host key:
10:03:59 /usr/bin/sudo -u root -- /usr/local/sbin/check-and-restart-php php7.2-fpm 100 on mw2295.codfw.wmnet returned [255]: Host key verification failed. 10:04:00 1 hosts had failures restarting php-fpm
The last Puppet run was at Wed Jan 27 09:24:45 UTC 2021 (10122 minutes ago). Puppet is disabled. $ last hashar pts/0 208.80.153.54 Wed Feb 3 10:07 still logged in legoktm pts/0 208.80.153.54 Wed Jan 27 01:22 - 01:22 (00:00) reboot system boot 4.19.0-13-amd64 Wed Jan 27 01:11 still running reboot system boot 4.19.0-13-amd64 Wed Jan 27 00:46 - 01:08 (00:21)
But it still in the dsh groups:
/etc/dsh/group/mediawiki-installation:mw2295.codfw.wmnet /etc/dsh/group/api_appserver:mw2295.codfw.wmnet