Fix cloudmetrics icinga prometheus check
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	• JHedden
	Jan 10 2020, 8:25 PM

Description

The name-based virtual host configuration on the cloudmetrics servers is not working correctly and tripping the icinga alarm "Prometheus cloudmetrics1001/labs restarted: beware possible monitoring artifacts"

Cloudmetrics is using the default prometheus apache vhost configuration, but since we have other vhosts defined for the grafana labs instance and no prometheus host name defined it doesn't work as expected.

Details

	Subject	Repo	Branch	Lines +/-
	labs prometheus: convert apache config to template	operations/puppet	production	+1 -1
	labs prometheus: only bind localhost and update vhost config	operations/puppet	production	+16 -1

Customize query in gerrit

Related Objects

Mentioned In: T148669: Proxy calls on Labs Grafana get 403 while not logged in

Event Timeline

• JHedden created this task.Jan 10 2020, 8:25 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJan 10 2020, 8:25 PM

• JHedden triaged this task as Low priority.Jan 10 2020, 8:25 PM

Mentioned in SAL (#wikimedia-operations) [2020-01-10T20:29:53Z] <jeh> cloudmetrics100[12] schedule downtime until Feb 28 2020 on prometheus check T242460

• JHedden claimed this task.Jan 15 2020, 8:45 PM

Change 565113 had a related patch set uploaded (by Jhedden; owner: Jhedden):
[operations/puppet@production] labs prometheus: only bind localhost and update vhost config

https://gerrit.wikimedia.org/r/565113

gerritbot added a project: Patch-For-Review.Jan 15 2020, 8:53 PM

Change 565113 merged by Jhedden:
[operations/puppet@production] labs prometheus: only bind localhost and update vhost config

https://gerrit.wikimedia.org/r/565113

Change 565125 had a related patch set uploaded (by Jhedden; owner: Jhedden):
[operations/puppet@production] labs prometheus: convert apache config to template

https://gerrit.wikimedia.org/r/565125

Change 565125 merged by Jhedden:
[operations/puppet@production] labs prometheus: convert apache config to template

https://gerrit.wikimedia.org/r/565125

• JHedden moved this task from Inbox to Doing on the cloud-services-team (Kanban) board.Jan 15 2020, 9:22 PM

I updated prometheus to only bind on the loopback interface and configured Apache to proxy requests to the servers FQDN to prometheus. These changes sync up the cloudmetrics configuration with production and clears up the icinga errors when checking this service.

Maintenance_bot removed a project: Patch-For-Review.Jan 15 2020, 10:10 PM

• JHedden mentioned this in T148669: Proxy calls on Labs Grafana get 403 while not logged in.Feb 18 2020, 2:39 PM

Fix cloudmetrics icinga prometheus checkClosed, ResolvedPublicActions

Description

Details

Related Objects

Event Timeline

Fix cloudmetrics icinga prometheus check
Closed, ResolvedPublic
Actions