Maniphest T193661

Alert in -releng when permanent hosts have low disk space
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	greg
	May 2 2018, 5:55 PM

Description

Let's stop having our alerting be people complaining of test failures.

Related Objects

Mentioned In: T201224: Jenkins should auto-depool nodes if they run out of disk space on specific partitions
Mentioned Here: T201224: Jenkins should auto-depool nodes if they run out of disk space on specific partitions

Event Timeline

Or, as @thcipriani just suggested, let's make a cronjob that deletes the workspaces before it becomes an issue, as that's what we do when it is an issue.

Looking at the RelEng SAL it seems like all we've been doing is`rm -rf /srv/jenkins-workspace/workspace/*`.

(( $(df --output=pcent /srv | awk -F '%' '!/Use/ {print $1}') > 95 )) && \
    rm -rf /srv/jenkins-workspace/workspace/*

We could cron that, or have a workspace cleanup job, or the jobs could clean-up after themselves (which might be the Right Thing). Adding @hashar to see if he has any preferences among the options here.

MarcoAurelio subscribed.May 2 2018, 9:00 PM

• Vvjjkkii renamed this task from Alert in -releng when permanent hosts have low disk space to urdaaaaaaa.Jul 1 2018, 1:12 AM

• Vvjjkkii added projects: CheckUser, Connected-Open-Heritage-Batch-uploads (RAÄ-KMB_1_2017-02), Tamil-Sites, Gamepress, Hashtags, Jade, KartoEditor, Language-2018-Apr-June, New-Editor-Experiences, Mail, TCB-Team (now WMDE-TechWish).

• Vvjjkkii updated the task description. (Show Details)

• Vvjjkkii removed a subscriber: MarcoAurelio.

thcipriani renamed this task from urdaaaaaaa to Alert in -releng when permanent hosts have low disk space .Jul 1 2018, 5:06 PM

thcipriani removed projects: TCB-Team (now WMDE-TechWish), Mail, New-Editor-Experiences, Language-2018-Apr-June, KartoEditor, Jade, Hashtags, Gamepress, Tamil-Sites, Connected-Open-Heritage-Batch-uploads (RAÄ-KMB_1_2017-02), CheckUser.

thcipriani updated the task description. (Show Details)

thcipriani added a subscriber: MarcoAurelio.

thcipriani mentioned this in T201224: Jenkins should auto-depool nodes if they run out of disk space on specific partitions.Aug 6 2018, 3:59 PM

Done as part of T201224 . We now automatically depool the faulty slaves and have an IRC notification.

Alert in -releng when permanent hosts have low disk space Closed, ResolvedPublicActions

Description

Related Objects

Event Timeline

Alert in -releng when permanent hosts have low disk space
Closed, ResolvedPublic
Actions