Page MenuHomePhabricator

Split metricsinfra alertmanager to separate hosts from prometheus
Closed, ResolvedPublic

Description

Alertmanager will have its own set of services around it (irc/phab relays and metricsinfra-specific authorization proxies for example), it's easier than prometheus to set up in a HA configuration and the exact ways of scaling prometheus on metricsinfra are still unknown, so it makes sense (at least to me) to split alertmanager to live on a separate VM than where Prometheus itself lives.

Event Timeline

taavi triaged this task as High priority.Jul 8 2021, 9:22 AM
taavi created this task.
Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Change 703708 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] metricsinfra: Add HAProxy for distributing http traffic

https://gerrit.wikimedia.org/r/703708

Change 704522 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] metricsinfra: Remove alertmanager apache proxy

https://gerrit.wikimedia.org/r/704522

Change 703708 merged by Bstorm:

[operations/puppet@production] metricsinfra: Add HAProxy for distributing http traffic

https://gerrit.wikimedia.org/r/703708

Change 705014 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] metricsinfra: Add separate alertmanager support

https://gerrit.wikimedia.org/r/705014

Change 704522 merged by Bstorm:

[operations/puppet@production] metricsinfra: Remove alertmanager apache proxy

https://gerrit.wikimedia.org/r/704522

Change 705014 merged by Bstorm:

[operations/puppet@production] metricsinfra: Add separate alertmanager support

https://gerrit.wikimedia.org/r/705014

Change 705632 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] metricsinfra: remove alertmanager from prometheus role

https://gerrit.wikimedia.org/r/705632

Change 705632 merged by Bstorm:

[operations/puppet@production] metricsinfra: remove alertmanager from prometheus role

https://gerrit.wikimedia.org/r/705632