--timeout flag for mwscript-k8s
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	RLazarus
	Tue, Oct 1, 3:14 AM

Description

On wikitech-l, @Dreamy_Jazz points out that sometimes maintenance scripts are run under the timeout command, in order to interrupt them after a set interval. With mwscript-k8s this technique doesn't work, since the maintenance script continues to run after mwscript-k8s terminates.

@CDanis suggests a new command-line flag for mwscript-k8s, which we pipe through the Helm chart to .spec.activeDeadlineSeconds in the Job configuration. (Of course the default will remain to leave activeDeadlineSeconds unset, so scripts run to completion.)

If the timeout is reached, Kubernetes will terminate the job, in state Failed. (That strikes me as correct, since the only option other than Failed is Complete. In this case the job could be called a "successful failure" in that it terminated on schedule, but never completed.) Script owners might have to inspect the job to differentiate between a failure due to timeout and a failure due to some unexpected error, but the rest of the mwscript-k8s apparatus, including cleanup, will work normally.

Details

	Subject	Repo	Branch	Lines +/-
	deployment_server: Add --timeout flag to mwscript-k8s	operations/puppet	production	+22 -0
	mediawiki: Allow setting mwscript job activeDeadlineSeconds	operations/deployment-charts	master	+5 -1

Customize query in gerrit

Related Objects
Search...

Status	Assigned	Task
Open	None	T341560 Migrate mwmaint server functionality to mw-on-k8s
Open	RLazarus	T341553 Allow running one-off scripts manually
Resolved	RLazarus	T376099 --timeout flag for mwscript-k8s

Event Timeline

RLazarus created this task.Tue, Oct 1, 3:14 AM

Restricted Application removed a project: Patch-For-Review. · View Herald TranscriptTue, Oct 1, 3:14 AM

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

ArielGlenn subscribed.Tue, Oct 1, 4:59 AM

Change #1078720 had a related patch set uploaded (by RLazarus; author: RLazarus):

[operations/deployment-charts@master] mediawiki: Allow setting mwscript job activeDeadlineSeconds

https://gerrit.wikimedia.org/r/1078720

Change #1078721 had a related patch set uploaded (by RLazarus; author: RLazarus):

[operations/puppet@production] deployment_server: Add --timeout flag to mwscript-k8s

https://gerrit.wikimedia.org/r/1078721

Change #1078720 merged by jenkins-bot:

[operations/deployment-charts@master] mediawiki: Allow setting mwscript job activeDeadlineSeconds

https://gerrit.wikimedia.org/r/1078720

Change #1078721 merged by RLazarus:

[operations/puppet@production] deployment_server: Add --timeout flag to mwscript-k8s

https://gerrit.wikimedia.org/r/1078721

This is now supported!

--timeout TIMEOUT     Set a deadline for the job, to interrupt it after a set interval. Examples: 1d, 2h, 30m, 40s, 40 -- number without unit is in seconds. (Default: No deadline)

Multichill mentioned this in T377782: Add --timeout to toolforge jobs .Mon, Oct 21, 8:55 PM

--timeout flag for mwscript-k8sClosed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

--timeout flag for mwscript-k8s
Closed, ResolvedPublic
Actions

Related Objects
Search...