The Grid Engine crontab used to be really stable with no failures in booting up jobs. However, with the k8s migration, I am receiving multiple reports from users about bot reports randomly not getting generated - twice within last 10 days.
From 22 May: job-g13-elig has last schedule of 2023-05-21 instead of 2023-05-22 like the other jobs with same cron schedule.
From today (2 June): job-g13-1week is skipped for a day and job-g13-elig was skipped twice!
In all cases, no emails were received (--emails onfailure was configured). And since the jobs didn't start at all, the error logs don't contain anything.
All jobs use only 256 Mi memory and the default 500m CPU allocation (cpu limit is 5000m for this tool) so resource limits shouldn't be getting hit.
jobs.yml file: https://github.com/siddharthvp/SDZeroBot/blob/master/jobs.yml