Page MenuHomePhabricator

Document Icinga Migration Strategy and Communicate to SRE Teams
Open, Needs TriagePublic

Description

This task aims to provide clear direction and documentation regarding the future of Icinga within our alerting infrastructure. As we transition to Prometheus + Alertmanager, we need to ensure engineers understand how to handle new alerts and what the long-term plan for Icinga entails. This task will involve reviewing the Alerting Infrastructure Roadmap, creating a detailed migration plan, and effectively communicating this plan to all SRE teams.

Problem:

  • Currently, there is a lack of clarity among engineers regarding how to handle new alerts and the overall direction for Icinga as we transition to Prometheus + Alertmanager.
  • This uncertainty may lead to inconsistent alerting practices, potential knowledge gaps, and difficulty in maintaining a streamlined alerting approach.

Goals:

  • Create comprehensive documentation outlining the Icinga migration plan and future state.
  • Communicate the Icinga strategy effectively to all SRE teams, ensuring a smooth transition.
  • Establish clear guidelines for handling new alerts during the transition period.
  • Define long-term goals and expectations for Icinga's role within the alerting infrastructure.