poolcounter-exporter upgrade
Open, Needs TriagePublic
Actions

Assigned To

Authored By

	fgiunchedi
	Apr 4 2023, 12:19 PM

Description

While investigating T309182: Evaluate 'pint' for Prometheus alerts pint flagged a problem with poolcounter alerts:

pint_problem{filename="/srv/alerts/ops/team-sre_poolcounter.yaml", instance="localhost:9123", job="pint", kind="alerting", name="PoolcounterFullQueues", problem="rate() should only be used with counters but "poolcounter_full_queues" is a gauge according to metrics metadata from prometheus "ops" at http://127.0.0.1:9900/ops", prometheus="ops", reporter="promql/rate", severity="bug", site="codfw"}

And indeed poolcounter_full_queues in our deployment is a gauge now, it was turned into a counter here: https://github.com/Wikia/poolcounter-prometheus-exporter/commit/f7bc8a2e13dff8e94bb6531cd523d92e9c0621d2

Updating our version of poolcounter-prometheus-exporter will fix the problem, cc @akosiaris what do you think ?

Related Objects
Search...

Status	Assigned	Task
		Restricted Task
Resolved	Joe	T83729 Add monitoring of poolcounter service
Resolved	fgiunchedi	T309182 Evaluate 'pint' for Prometheus alerts
Open	akosiaris	T333947 poolcounter-exporter upgrade