Page MenuHomePhabricator

[jobs-api,jobs-cli] Expose hidden quota errors more clearly
Open, MediumPublic

Description

With the various Kubernetes abstractions most of Toolforge quota errors happen at the Job or Pod level and so not directly exposed to tool maintainers when creating or editing the tools.

Event Timeline

I suspect updating jobs-framework-emailer to catch out of quota errors is the way to go here.

Does emails: onfailure config for the job result in an email if the job fails to start due to a quota error?

Does emails: onfailure config for the job result in an email if the job fails to start due to a quota error?

It appears there's no clear-cut answer to this. Quota issue isn't immediately causing an error. Job waits around for some time till quota is available and hence only gets delayed - good enough for cron jobs.

dcaro renamed this task from Expose hidden quota errors more clearly to [jobs-api,jobs-cli] Expose hidden quota errors more clearly.Mar 11 2024, 2:26 PM
dcaro triaged this task as Medium priority.
dcaro edited projects, added Toolforge; removed Toolforge Jobs framework.
dcaro moved this task from Backlog to Ready to be worked on on the Toolforge board.