[maintain-kubeusers] Increment default services quota
Open, HighPublic
Actions

Assigned To

Authored By

	• taavi
	Apr 15 2024, 11:06 AM

Description

For T348758: [jobs-api,jobs-cli] Support services in jobs we should increment the default services quota from the current 1 to some TBD higher value.

Details

	Title	Reference	Author	Source Branch	Dest Branch
	maintain-kubeusers: bump to 0.0.131-20240525201329-ca173bf3	repos/cloud/toolforge/toolforge-deploy!284	project_1317_bot_df3177307bed93c3f34e421e26c86e38	bump_maintain-kubeusers	main
	[maintain-kubeusers] increment default services quota	repos/cloud/toolforge/maintain-kubeusers!25	raymond-ndibe	increase_services_default_quota	main

Customize query in GitLab

Related Objects
Search...

Status	Subtype	Assigned	Task
Open		dcaro	T362051 [components-api] First iteration of the component API
Open		dcaro	T362072 [components-api] Add support for non-public services
Open		dcaro	T362077 [components-api] Add webservice support (to refine)
Open		dcaro	T362069 [components-api] Get a skeleton of API webservice and implement `/tool/<toolname>/deploy` with build-only features
Open		dcaro	T362070 [components-api] Get a minimal version of the config with build-only data
Open		None	T363346 [jobs-api] Prefix all endpoints with `/tool/<toolname>`
In Progress		Slst2020	T363808 [builds-api, builds-cli] Prefix all endpoints with `/tool/<toolname>`
Open		None	T363809 [envvars-api] Prefix all endpoints with `/tool/<toolname>`
Open		dcaro	T363983 [toolforge] Investigate authentication
Open		None	T365681 toolforge: kubernetes can't revoke certificates
Open	Feature	Raymond_Ndibe	T348755 [jobs-api,webservice] Run webservices via the jobs framework
In Progress	Feature	Raymond_Ndibe	T348758 [jobs-api,jobs-cli] Support services in jobs
Open		Raymond_Ndibe	T362520 [maintain-kubeusers] Increment default services quota

Event Timeline

• taavi triaged this task as High priority.Apr 15 2024, 11:06 AM

• taavi created this task.

dcaro edited projects, added Toolforge (Toolforge iteration 09); removed Toolforge (Toolforge iteration 08).Apr 17 2024, 7:49 AM

• taavi merged a task: T364780: increase quota for services.Tue, May 14, 1:08 PM

• taavi added a subscriber: Raymond_Ndibe.

to some TBD higher value.

How about 16 to match the number of pods in the default quota? We can certainly pick any other arbitrary number >1 as well, but this at least has some rationale.

I find it pretty unlikely that a typical tool will exhaust either the Pod or Service quota at 16; I think the most typical tool will continue to be a webservice that consumes one Pod and one Service. In the spirit of T306324: Consider improving quota workflow I think we should set the default limits quite high relative to expected use when possible so that we don't end up discouraging folks from innovating by making them stop to ask for permission to use the platform.

bd808 mentioned this in T364883: [maintain-kubeusers] Increment default Secrets (envvars) quota.Tue, May 14, 4:41 PM

In T362520#9796110, @bd808 wrote:

to some TBD higher value.

How about 16 to match the number of pods in the default quota? We can certainly pick any other arbitrary number >1 as well, but this at least has some rationale.

I find it pretty unlikely that a typical tool will exhaust either the Pod or Service quota at 16; I think the most typical tool will continue to be a webservice that consumes one Pod and one Service. In the spirit of T306324: Consider improving quota workflow I think we should set the default limits quite high relative to expected use when possible so that we don't end up discouraging folks from innovating by making them stop to ask for permission to use the platform.

I thought the plan is to only allow services for continuous jobs (which web-service will probably become entangled with soon)? if that's the case then we should make services quota the same as deployments quota which is currently 3. We can also increase this but atleast both services and deployments quotas should be in lockstep

In T362520#9802568, @Raymond_Ndibe wrote:

I thought the plan is to only allow services for continuous jobs (which web-service will probably become entangled with soon)? if that's the case then we should make services quota the same as deployments quota which is currently 3. We can also increase this but atleast both services and deployments quotas should be in lockstep

$ kubectl describe quota
Name:                   tool-wikibugs
Namespace:              tool-wikibugs
Resource                Used    Hard
--------                ----    ----
configmaps              2       10
count/cronjobs.batch    0       50
count/deployments.apps  6       6
count/jobs.batch        0       15
limits.cpu              3       8
limits.memory           3Gi     8Gi
persistentvolumeclaims  0       0
pods                    6       16
requests.cpu            1375m   4
requests.memory         1536Mi  4Gi
secrets                 21      64
services                2       2
services.nodeports      0       0

I guess wikibugs has had a quota bump for this already (count/deployments.apps 6) and is currently bumping up against the raised limit too. Why do we care how many Deployments a namespace has as long as the CPU and RAM quotas allow the Pods they manage?

Very valid point Bryan. I can't think of a reason too. The most important quotas to put in place imo are hardware resource quotas like ram and cpu. Maybe @dcaro and @taavi has an opinion?

We need to have some quota in place to prevent a misbehaving tool from taking kube-apiserver down by creating hundreds or thousands of unfillable ReplicaSets (a similar thing has happened in the past, see T301081), but I have no objections to raising the deployment quota to match the pod one for example.

Raymond_Ndibe claimed this task.Tue, May 21, 10:01 PM

In T362520#9807885, @taavi wrote:

We need to have some quota in place to prevent a misbehaving tool from taking kube-apiserver down by creating hundreds or thousands of unfillable ReplicaSets (a similar thing has happened in the past, see T301081), but I have no objections to raising the deployment quota to match the pod one for example.

+1 on using the same number as pods for deployments/services, it seems like you'd always want to back up a deployment with at least one pod, and a service with at least 1 deployment (though we might want to enable people to use more ExternalName type of services eventually, I don't have any use-case in mind though, so we might want to wait until then to care about them).

dcaro edited projects, added Toolforge (Toolforge iteration 10); removed Toolforge (Toolforge iteration 09).Wed, May 22, 4:41 PM

raymond-ndibe opened https://gitlab.wikimedia.org/repos/cloud/toolforge/maintain-kubeusers/-/merge_requests/25

[maintain-kubeusers] increment default services quota

raymond-ndibe merged https://gitlab.wikimedia.org/repos/cloud/toolforge/maintain-kubeusers/-/merge_requests/25

[maintain-kubeusers] increment default services quota

project_1317_bot_df3177307bed93c3f34e421e26c86e38 opened https://gitlab.wikimedia.org/repos/cloud/toolforge/toolforge-deploy/-/merge_requests/284

maintain-kubeusers: bump to 0.0.131-20240525201329-ca173bf3

raymond-ndibe merged https://gitlab.wikimedia.org/repos/cloud/toolforge/toolforge-deploy/-/merge_requests/284

maintain-kubeusers: bump to 0.0.131-20240525201329-ca173bf3

Maintenance_bot removed a project: Patch-For-Review.Sat, May 25, 9:30 PM