Page MenuHomePhabricator

tbs: user-story 9: Create an alert on metricsinfra for harbor being down on toolsbeta
Open, Needs TriagePublic8 Estimated Story Points

Description

This can be achieved by reusing the blackbox probe, that is used in puppet with something like:

prometheus::blackbox::check::http { $static_domain:
    port                => 80,
    # this should always exist
    path                => '/admin/fingerprints/',
    ip_families         => ['ip4'],
    prometheus_instance => 'tools',
    team                => 'wmcs',
    severity            => 'warning',
}

Note that this should be attached to the harbor profile, and the path should be changed to something harbor will reply 2xx on when up.

Related Objects

Event Timeline

dcaro triaged this task as High priority.Dec 14 2022, 2:02 PM
dcaro created this task.
dcaro added a project: Toolforge Build Service.
dcaro removed the point value for this task.
dcaro raised the priority of this task from High to Needs Triage.Mon, Mar 6, 3:03 PM