⚓ T113806 Add monitoring and capacity planning for Nodepool

	Subject	Repo	Branch	Lines +/-
	nodepool: use nrpe:: class for monitoring	operations/puppet	production	+1 -3
	nodepool: monitor nodepoold is present	operations/puppet	production	+8 -0

hashar raised the priority of this task from to Needs Triage.

hashar updated the task description. (Show Details)

Will fill sub tasks eventually. Some I can pair them with @zeljkofilipin for puppet level up :-}

Change 244171 had a related patch set uploaded (by Hashar):
nodepool: monitor nodepoold is present

hashar set Security to None.

Change 244171 merged by Andrew Bogott:
nodepool: monitor nodepoold is present

Change 244229 had a related patch set uploaded (by Hashar):
nodepool: use nrpe:: class for monitoring

Change 244229 merged by Andrew Bogott:
nodepool: use nrpe:: class for monitoring

I think it is good enough for now. https://grafana.wikimedia.org/dashboard/db/nodepool has much of what I wanted.

Reopening. Would need some notifications when pool is exhausted, server side errors, and leaked instances (or alien instances).

hashar removed hashar as the assignee of this task.Dec 14 2016, 8:50 PM

hashar triaged this task as Low priority.Jun 16 2017, 11:07 AM

We're migrating away (see eg T187797), no need to do this now.

Add monitoring and capacity planning for Nodepool
Closed, DeclinedPublic
Actions