Investigate cpu/ram requests and limits for DaemonSets pods
Closed, DeclinedPublic
Actions

Assigned To

None

Authored By

	bd808
	Feb 14 2020, 12:54 AM

Description

While adding the new nodes for T244791: Scale up 2020 Kubernetes cluster for final migration of legacy cluster workloads I noticed that an "empty" worker has about 10% of available CPU and 13% of available RAM consumed by the calico, kube-proxy, and cadvisor pods. This feels like a lot of overhead on each worker for an "idle" state.

Calico pods are requesting 250m CPU with no explicit RAM request and no explicit limit on CPU or RAM.
Kube-proxy pods have no explicit Request or Limit values in the Pod template.
Cadvisor pods request 150m CPU and 200Mi RAM with 300m CPU and 2000Mi RAM limits.

Can any of these Request values be tuned downward? Can reasonable Limit values be set for everything?

Related Objects
Search...

Status	Subtype	Assigned	Task
Declined		None	T245230 Investigate cpu/ram requests and limits for DaemonSets pods
Resolved		• Bstorm	T214513 Deploy and migrate tools to a Kubernetes v1.15 or newer cluster
Resolved		• Bstorm	T111914 Setup DNS for kubernetes services
Resolved		aborrero	T142862 Setup Kubernetes Masters in a HA setup
Resolved		aborrero	T215663 Stand up upgraded Toolforge etcd clusters
Resolved		aborrero	T215530 Sort out the best method of spinning up multiple toolforge kubernetes masters
Resolved		aborrero	T215679 Sort out and test deploying the worker nodes in a sane fashion
Resolved		aborrero	T215975 Package/copy kubeadm, kubelet, docker-ce and kubectl to Toolforge Aptly or Reprepro
Resolved		Jprorama	T172855 Create visual diagram of documented components of Toolforge Kubernetes cluster
Resolved		aborrero	T215531 Deploy upgraded Kubernetes to toolsbeta
Resolved		• Bstorm	T215529 Puppetize/stand up a load balancer for K8s API servers
Resolved		aborrero	T226098 Toolforge: modernize deployment for etcd in k8s
Resolved		aborrero	T228267 Toolforge: iptables flavor for Debian Buster-based k8s cluster
Resolved		aborrero	T228500 Toolforge: evaluate ingress mechanism
Resolved		aborrero	T234032 Toolforge ingress: create a default landing page for unknown/default URLs
Resolved		aborrero	T234037 Toolforge ingress: decide on final layout of north-south proxy setup
Resolved		aborrero	T235059 Toolforge: refresh puppet code for proxy (dynamicproxy) to support Debian Buster
Resolved		• Bstorm	T234231 Toolforge ingress: decide on how ingress configuration objects will be managed
Resolved		• dduvall	T236203 Add CI checks for golang admission controllers
Resolved		• Bstorm	T254293 Change to admission controller readme.md failed to pass gate-and-submit jobs
Resolved		aborrero	T228660 Toolforge: new k8s: issues with the initial coredns setup
Resolved		• Bstorm	T228887 Update pause container in our internal registry
Resolved		• Bstorm	T229009 Proposal: ditching the master name in kubernetes servers
Resolved		• Bstorm	T234702 Review and establish configurable quotas for users in the new Kubernetes cluster
Resolved		aborrero	T236074 Toolforge: rebuild the new k8s toolsbeta deployment and write final docs
Resolved		aborrero	T236249 Toolforge: new k8s: upload internal docker images to our registry
Resolved		aborrero	T236824 Toolforge: new k8s: get new deb packages for 1.15.4 or 1.15.5
Resolved		aborrero	T237443 toolsbeta: new k8s: deploy a front proxy (dynamicproxy)
Resolved		• Bstorm	T237541 CoreDNS in the new k8s cluster cannot talk to the Cloud recursors
Declined		None	T238641 toolforge: some additional testing before final migration
Resolved		aborrero	T239403 toolforge: new k8s: scale up a bit the cluster before final tests and initial migrations
Open		None	T239404 toolforge: new k8s: evaluate DNS (coredns) autoscale options
Resolved		aborrero	T239405 toolforge: new k8s: evaluate ingress controller reload behaviour
Stalled		None	T239406 toolforge: new k8s: evalute and test firewalling via calico
Resolved		aborrero	T238655 toolforge: new k8s: issues with the apiserver and etcd
Resolved		• Bstorm	T215553 Figure out cert management for Toolforge kubernetes and make it clear in documents, etc. for the upgrade
Resolved		• Bstorm	T169287 etcd config depends on puppet certs, but puppet doesn't know
Resolved		yuvipanda	T119814 Figure out how to deal with SSL cert issues for kubernetes masters
Duplicate		None	T144153 Move kubernetes authentication to using X.509 client certs
Resolved		• Bstorm	T215678 Replace each of the custom controllers with something in a new Toolforge Kubernetes setup
Resolved		• Bstorm	T227290 Design and document how to integrate the new Toolforge k8s cluster with PodSecurityPolicy
Resolved		• Bstorm	T238162 Establish a process for renewing TLS certs for the 2 webhook controllers
Duplicate		None	T224273 Toolforge: develop new k8s cluster in toolsbeta
Resolved		• Bstorm	T233372 Create a "novaobserver" equivalent for Toolforge Kubernetes cluster inspection
			Restricted Task
Resolved		aborrero	T235627 Toolforge: upgrade main proxy servers to Debian Buster
Duplicate		None	T235756 Toolforge: webservice utility: add support for thew new k8s setup
Resolved		• Bstorm	T236202 Modify webservice and maintain-kubeusers to allow switching to the new cluster
Resolved		• Bstorm	T228499 Toolforge: changes to maintain-kubeusers
Resolved		• Bstorm	T229058 Replace the nslcd mount in containers from the old Toolforge cluster with something that will work with sssd in the new one
Resolved		• Bstorm	T237836 `webservice restart` regression with backend=kubernetes in webservice 0.51
Resolved		aborrero	T236826 Toolforge: new k8s: initial build of the new kubernetes cluster
Resolved		aborrero	T237633 Request increased quota for tools Cloud VPS project
Resolved		aborrero	T237643 toolforge: new k8s: figure out metrics / observability
Resolved		aborrero	T237557 new proxy and etcd nodes unreachable by ssh for tools-prometheus
Resolved		aborrero	T238058 toolforge: prometheus-node-exporter not working on tools-proxy-06
Resolved		aborrero	T238096 Toolforge: prometheus: refresh setup
Resolved		aborrero	T245180 Document and test failing over prometheus
Resolved		aborrero	T240402 Deploy or consciously decide not to deploy metrics-server in toolforge kubernetes
Resolved		aborrero	T241853 Move metrics-server and kube-state-metrics into the new metrics namespace
Resolved		• Bstorm	T237784 Document migration plans and timelines
Resolved		• Bstorm	T237789 Document (and execute) the upgrade process for the new Toolforge K8s cluster
Resolved		• Bstorm	T238654 toolforge: new k8s: issues with routing interfering with DNS in the cluster as well as the webhook controllers
Duplicate		None	T239407 toolforge: new k8s: package newer/more convenient python3 k8s client libs
Resolved		aborrero	T239409 toolforge: new k8s: introduce more robust controls for deb pkg versions
			Restricted Task
Open		None	T272905 Reduce privs of metrics pods where we can
Resolved		• Bstorm	T240922 Change name of commons_describer tool or provide some workaround for Kubernetes/DNS
Resolved		bd808	T240923 Fix toolschecker's insistence that a kubeconfig is json
Invalid		aborrero	T240925 `kubectl get pods` fails after switching to new k8s cluster
Resolved		bd808	T241008 New k8s cluster routing behaving strangely for bd808-test tool
Resolved		bd808	T241310 Kubernetes ingress passes it's port & proto to apps rather than the port & proto from the front proxy
			Restricted Task
Resolved	Security	• Bstorm	T242067 Error joining new worker node to Toolforge Kubernetes cluster
Resolved		aborrero	T242719 https://tools.wmflabs.org/{toolname} no longer redirects to https://tools.wmflabs.org/{toolname}/ on new k8s cluster
Resolved		bd808	T242824 Tool account cannot list all namespaced objects in its Kubernetes namespace
Duplicate		None	T243468 Add smarter resourcing logic to kubernetes backend of webservice
Resolved		• Bstorm	T244289 Improve limit range management in webservice for Kubernetes
Resolved		bd808	T244293 Add a function to webservice called "migrate" that will push a tool from the old cluster on Kubernetes to the new one
Resolved		bd808	T244791 Scale up 2020 Kubernetes cluster for final migration of legacy cluster workloads

Event Timeline

bd808 created this task.Feb 14 2020, 12:54 AM

Reedy renamed this task from Invesitgate cpu/ram requests and limits for DaemonSets pods to Investigate cpu/ram requests and limits for DaemonSets pods.Feb 14 2020, 6:24 AM

I absolutely do not want to tune down the request values on cluster-critical pods, regarding Calico and kube-proxy. That's one of the design problems in the old cluster is that the kernel will sacrifice workloads that aren't in the cluster (like flannel) to save things that are in the cluster (nearly all user workloads). The ingress controllers are requested at 1GB RAM, for instance and, I think, 1 CPU. Without that they were scheduled terribly and caused early cluster failures. I kind of want to go the other way and reserve some ram for it as well.

All that said, kube-proxy having a lack of limits doesn't seem like a problem to me because the limits will be enforced by basically having the node commit suicide, if they are enforced at all. Kube-proxy is generally pretty light since it is just a firewall manager, but if it starts consuming all the resources, we may as well let it. The node will be hosed either way. It will be hosed harder if calico collapses than kube-proxy, but either way, webservices will stop working there.

Cadvisor on the other hand, we can do things with. That's just there for monitoring, not some critical function that keeps it moving. 150m is an extremely small request, though.

Question: is that CPU and RAM measuring with metrics-server (kubectl top)? If so, that's what these things need. If we restrict it, they just fall into a kill loop.

In T245230#5885092, @Bstorm wrote:

I absolutely do not want to tune down the request values on cluster-critical pods, regarding Calico and kube-proxy.

Fair. This is why I phrased the task as investigate and not some direct call for remediation action.

Cadvisor on the other hand, we can do things with.

The 2000Mi RAM hard limit was the more concerning number to me for this one. With only 8G of total RAM per node including space for the kernel, 2G for metrics collection feels excessive. I know hard limits and requests are not the same thing, but limits are there to help us tune what gets evicted first when misbehaving to some extent.

Question: is that CPU and RAM measuring with metrics-server (kubectl top)? If so, that's what these things need. If we restrict it, they just fall into a kill loop.

It is measuring with the metrics.k8s.io/v1beta1 API which I think is functionally equivalent to kubectl top. See https://tools.wmflabs.org/k8s-status/nodes/ and its drill-down pages for where I was reading values from.

In T245230#5885549, @bd808 wrote:

In T245230#5885092, @Bstorm wrote:

I absolutely do not want to tune down the request values on cluster-critical pods, regarding Calico and kube-proxy.

Fair. This is why I phrased the task as investigate and not some direct call for remediation action.

Legit. My initial reply was more like "EEP!" And this was the toned down version 😉

The 2000Mi RAM hard limit was the more concerning number to me for this one. With only 8G of total RAM per node including space for the kernel, 2G for metrics collection feels excessive. I know hard limits and requests are not the same thing, but limits are there to help us tune what gets evicted first when misbehaving to some extent.

Yeah, that seems high. I also wonder if we shouldn't be building bigger nodes with smaller disks for Kubernetes in Toolforge. 1G seems like it should be more than enough, no? I wonder what they consume under load? I also am now curious if that's default from upstream or if that's just a value selected when we deployed it. All worth checking.

It is measuring with the metrics.k8s.io/v1beta1 API which I think is functionally equivalent to kubectl top. See https://tools.wmflabs.org/k8s-status/nodes/ and its drill-down pages for where I was reading values from.

Ah, so that unfortunately would represent what these things actually consume at rest. 😬

bd808 triaged this task as Low priority.Feb 25 2020, 5:06 PM

bd808 moved this task from Inbox to Soon! on the cloud-services-team (Kanban) board.

bd808 removed a parent task: T214513: Deploy and migrate tools to a Kubernetes v1.15 or newer cluster.Mar 2 2020, 6:06 PM

bd808 added a subtask: T214513: Deploy and migrate tools to a Kubernetes v1.15 or newer cluster.

bd808 closed subtask T214513: Deploy and migrate tools to a Kubernetes v1.15 or newer cluster as Resolved.Apr 11 2020, 8:47 PM

We decided to decline this task in the backlog grooming meeting.

Investigate cpu/ram requests and limits for DaemonSets podsClosed, DeclinedPublicActions

Description

Related ObjectsSearch...

Event Timeline

Investigate cpu/ram requests and limits for DaemonSets pods
Closed, DeclinedPublic
Actions

Related Objects
Search...