We need monitoring+alerting for orchestrator's health.
Description
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
orchestrator: add monitoring for process and TCP port | operations/puppet | production | +37 -0 |
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T265990 orchestrator: Puppetize | |||
Resolved | Dzahn | T266338 orchestrator: Add service monitoring |
Event Timeline
Change 636067 had a related patch set uploaded (by Dzahn; owner: Dzahn):
[operations/puppet@production] orchestrator: add monitoring for process and TCP port
We don't have it in production, so putting this to low as we aren't on a hurry for this as of today
Change 636067 merged by Dzahn:
[operations/puppet@production] orchestrator: add monitoring for process and TCP port
New checks have been added to Icinga:
https://icinga.wikimedia.org/cgi-bin/icinga/status.cgi?search_string=orchestrator
But notifications for everything on this new host are disabled. Up to you when you want to enable them.
Thank you Daniel!
This looks good for now, so far we are going to keep notifications disabled on the host as we are doing many changes still, some of which involves restarts, breakages etc!
Thanks for helping out!
Closing this as it is done we just keep notifications disabled for now.
Thanks Daniel