We got an icinga/VO notification today that labstore1006/1007 had a systemd unit failure.
The issue seems to be:
Jun 01 05:00:06 labstore1007 systemd[1]: Started Copy mediawiki_history_dumps files from Hadoop HDFS.. Jun 01 05:00:06 labstore1007 kerberos-run-command[12564]: User dumpsgen executes as user dumpsgen the command ['/bin/bash', '-c', '/usr/local/bin/hdfs-rsync -r -t --delete --exclude "readme.html" --chmod=go-w hdfs:///wmf/data/archive/mediawiki/history/{$(/bin/date --date="$(/bin/date +%Y-%m-15) -1 month" +"%Y-%m"),$(/bin/date --date="$(/bin/date +%Y-%m-15) -2 month" +"%Y-%m")} file:///srv/dumps/xmldatadumps/public/other/mediawiki_history/'] Jun 01 05:00:08 labstore1007 kerberos-run-command[12564]: Error: Argument parsing error: Jun 01 05:00:08 labstore1007 kerberos-run-command[12564]: Error validating src list: Jun 01 05:00:08 labstore1007 kerberos-run-command[12564]: hdfs:///wmf/data/archive/mediawiki/history/2020-05 does not exist Jun 01 05:00:08 labstore1007 kerberos-run-command[12564]: Try --help for more information. Jun 01 05:00:08 labstore1007 systemd[1]: analytics-dumps-fetch-mediawiki_history_dumps.service: Main process exited, code=exited, status=1/FAILURE Jun 01 05:00:08 labstore1007 systemd[1]: analytics-dumps-fetch-mediawiki_history_dumps.service: Failed with result 'exit-code'.