Page MenuHomePhabricator

Monitor the load on the Kubernetes API masters
Closed, ResolvedPublic

Description

We will be creating a lot of Pods while the dumps are ongoing (although, this will be limited by the airflow pool parallelism setting).
We need to keep an eye on the load of the Kubernetes API masters, to make sure we're not causing cluster-wide performance degradations.

Event Timeline

brouberol triaged this task as Medium priority.
BTullis subscribed.

We have monitored the first runs of full-sized dumps runs and it seems that the load on the Kubernetes masters is within tolerance.
There are some issues around scheduling timeouts for pods and some latencies in T396738: Increase threshold for KubeletOperationalLatency for dse-k8s-eqiad but I think that we can address those independently of this ticket.

BTullis claimed this task.