Page MenuHomePhabricator

Alert in need of triage: KubernetesWorkerUnschedulable
Closed, ResolvedPublic

Description

The alert KubernetesWorkerUnschedulable has started firing 1 month ago.

Labels
alertname=KubernetesWorkerUnschedulable
node=wikikube-worker2100.codfw.wmnet
prometheus=k8s
severity=warning
site=codfw
source=prometheus
team=sre
Annotations
NameContent
dashboardhttps://grafana.wikimedia.org/d/G8zPL7-Wz/kubernetes-node?orgId=1&var-dc=codfw%20prometheus%2Fk8s&var-instance=wikikube-worker2100.codfw.wmnet
descriptionKubernetes worker wikikube-worker2100.codfw.wmnet has been unschedulable for >= 24h k8s@codfw
runbookhttps://wikitech.wikimedia.org/wiki/Kubernetes
summaryKubernetes worker wikikube-worker2100.codfw.wmnet has been unschedulable for >= 24h k8s@codfw
Links

Triage metadata. Do not delete.
fingerprint=4d20cd61d838f7dd

Event Timeline

This host was set aside for mw-experimental work by @jijiki, I'll silence the alert for a month.

jijiki changed the task status from Open to Stalled.Aug 1 2025, 11:00 AM

sorry folks, host's number is up for retirement, my bad. tx @Clement_Goubert

Just a heads up that the alert fired again, can it be silenced for another month?

Just a heads up that the alert fired again, can it be silenced for another month?

Silencing for 1 month, decoms are in progress.

Clement_Goubert changed the task status from Stalled to In Progress.Jan 16 2026, 3:32 PM
Clement_Goubert assigned this task to jasmine_.
Clement_Goubert added a subscriber: jasmine_.

@jasmine_ please resolve this task when done with T409104 T409103

@jasmine_ please resolve this task when done with T409104 T409103

Done, thanks all~