Page MenuHomePhabricator

Transition to Pyrra for SLO Visualization and Management
Open, MediumPublic

Description

Today we manage SLO dashboards using an in-house jsonnet template which is rendered and deployed to grafana using grafana-grizzly. Since establishing this process, a self contained SLO management tool Pyrra (https://pyrra.dev) has seen much active development and offers several benefits including improved (dedicated) SLO visualization, search, labeling, automation of recording rules, integrated multi burn alerting, and more.

This task initially served as a placeholder to explore this (hence the patch history) and is being expanded to serve as a tracking task for Pyrra deployment and integration.

High level checklist, in rough order:

Details

SubjectRepoBranchLines +/-
operations/grafana-grizzlymaster+8 -3
operations/puppetproduction+31 -0
operations/puppetproduction+59 -59
operations/puppetproduction+1 -1
operations/puppetproduction+49 -4
operations/dnsmaster+6 -1
operations/puppetproduction+36 -0
operations/puppetproduction+40 -0
operations/puppetproduction+12 -11
operations/puppetproduction+41 -0
operations/puppetproduction+38 -35
operations/puppetproduction+68 -60
operations/puppetproduction+16 -8
operations/puppetproduction+68 -0
operations/puppetproduction+150 -141
operations/puppetproduction+38 -1
operations/puppetproduction+2 -2
operations/puppetproduction+32 -0
operations/puppetproduction+3 -0
operations/puppetproduction+1 -1
operations/puppetproduction+36 -0
operations/puppetproduction+25 -0
operations/puppetproduction+3 -3
operations/puppetproduction+4 -3
operations/puppetproduction+2 -1
operations/puppetproduction+1 -1
operations/puppetproduction+7 -0
operations/dnsmaster+4 -0
operations/puppetproduction+15 -0
operations/puppetproduction+3 -2
operations/puppetproduction+5 -4
operations/puppetproduction+3 -1
operations/puppetproduction+1 -1
operations/puppetproduction+0 -3
operations/puppetproduction+1 -1
operations/puppetproduction+3 -0
operations/puppetproduction+10 -0
operations/puppetproduction+32 -0
operations/puppetproduction+71 -0
operations/debs/pyrramaster+227 -0
Show related patches Customize query in gerrit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Change 945620 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: fix typo in apache config

https://gerrit.wikimedia.org/r/945620

Change 945620 merged by Herron:

[operations/puppet@production] pyrra: fix typo in apache config

https://gerrit.wikimedia.org/r/945620

Change 945623 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: remove apache rewrite config

https://gerrit.wikimedia.org/r/945623

Change 945623 merged by Herron:

[operations/puppet@production] pyrra: remove apache rewrite config

https://gerrit.wikimedia.org/r/945623

Change 945625 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra-filesystem: update prometheus folder arg name

https://gerrit.wikimedia.org/r/945625

Change 945625 merged by Herron:

[operations/puppet@production] pyrra-filesystem: update prometheus folder arg name

https://gerrit.wikimedia.org/r/945625

Change 945627 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra-filesystem: ensure config directory

https://gerrit.wikimedia.org/r/945627

Change 945627 merged by Herron:

[operations/puppet@production] pyrra-filesystem: ensure config directory

https://gerrit.wikimedia.org/r/945627

Change 945630 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra-filesystem: add package require for file resource

https://gerrit.wikimedia.org/r/945630

Change 945630 merged by Herron:

[operations/puppet@production] pyrra-filesystem: add package require for file resource

https://gerrit.wikimedia.org/r/945630

Change 956867 had a related patch set uploaded (by Filippo Giunchedi; author: Filippo Giunchedi):

[operations/puppet@production] titan: move pyrra off thanos role

https://gerrit.wikimedia.org/r/956867

Change 956867 merged by Filippo Giunchedi:

[operations/puppet@production] titan: move pyrra off thanos role

https://gerrit.wikimedia.org/r/956867

Change 961128 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: add trafficserver mapping

https://gerrit.wikimedia.org/r/961128

Change 961129 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] services: add pyrra conftool-data and service stub entry

https://gerrit.wikimedia.org/r/961129

Change 961130 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: use load balancing

https://gerrit.wikimedia.org/r/961130

Change 961131 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: add serveraliases and redirect to apache config

https://gerrit.wikimedia.org/r/961131

Change 961132 had a related patch set uploaded (by Herron; author: Herron):

[operations/dns@master] pyrra add service dns entries

https://gerrit.wikimedia.org/r/961132

Change 961133 had a related patch set uploaded (by Herron; author: Herron):

[operations/dns@master] pyrra: add public dns entries

https://gerrit.wikimedia.org/r/961133

Change 961128 merged by Herron:

[operations/puppet@production] pyrra: add trafficserver mapping

https://gerrit.wikimedia.org/r/961128

Change 961133 merged by Herron:

[operations/dns@master] pyrra: add public dns entries

https://gerrit.wikimedia.org/r/961133

Change 961131 merged by Herron:

[operations/puppet@production] pyrra: add serveraliases and redirect to apache config

https://gerrit.wikimedia.org/r/961131

Change 961862 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] thanos::httpd: include rewrite module

https://gerrit.wikimedia.org/r/961862

Change 961862 merged by Herron:

[operations/puppet@production] thanos::httpd: include rewrite module

https://gerrit.wikimedia.org/r/961862

Change 930628 merged by Herron:

[operations/puppet@production] thanos-rule: add pyrra filesystem operator output dir to search path

https://gerrit.wikimedia.org/r/930628

Mentioned in SAL (#wikimedia-operations) [2023-10-17T22:03:21Z] <herron> pyrra.wm.o upgraded to 0.7.1 T302995

Change 966906 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra::filesystem::config: add pyrra filesystem operator config manager

https://gerrit.wikimedia.org/r/966906

Change 966917 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: add prometheus external url

https://gerrit.wikimedia.org/r/966917

Change 966917 merged by Herron:

[operations/puppet@production] pyrra: add prometheus external url

https://gerrit.wikimedia.org/r/966917

Change 966920 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra::filesystem: correct config permissions

https://gerrit.wikimedia.org/r/966920

Change 966920 merged by Herron:

[operations/puppet@production] pyrra::filesystem: correct config permissions

https://gerrit.wikimedia.org/r/966920

Change 966909 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: add logstash requests slo

https://gerrit.wikimedia.org/r/966909

Change 966906 merged by Herron:

[operations/puppet@production] pyrra::filesystem::config: add pyrra filesystem operator config manager

https://gerrit.wikimedia.org/r/966906

Change 966909 merged by Herron:

[operations/puppet@production] pyrra: add logstash requests slo

https://gerrit.wikimedia.org/r/966909

Change 967273 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra-filesystem: enable generic recording rules

https://gerrit.wikimedia.org/r/967273

Change 967273 merged by Herron:

[operations/puppet@production] pyrra-filesystem: enable generic recording rules

https://gerrit.wikimedia.org/r/967273

herron renamed this task from Explore dedicated (non-grafana) SLO Visualization and Management to Explore Pyrra for SLO Visualization and Management.Oct 23 2023, 2:44 PM
herron updated the task description. (Show Details)

Change 967950 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: onboard varnish-requests as pilot SLO

https://gerrit.wikimedia.org/r/967950

Change 974148 had a related patch set uploaded (by Elukey; author: Elukey):

[operations/puppet@production] profile::thanos: add new istio recording rule

https://gerrit.wikimedia.org/r/974148

Change 974149 had a related patch set uploaded (by Elukey; author: Elukey):

[operations/puppet@production] profile::pyrra::filesystem: add Lift Wing pilot

https://gerrit.wikimedia.org/r/974149

Change 974148 merged by Elukey:

[operations/puppet@production] profile::thanos: add new istio recording rule

https://gerrit.wikimedia.org/r/974148

Change 974149 merged by Elukey:

[operations/puppet@production] profile::pyrra::filesystem: add Lift Wing pilot

https://gerrit.wikimedia.org/r/974149

Change 974496 had a related patch set uploaded (by Elukey; author: Elukey):

[operations/puppet@production] profile::pyrra::filesystem: improve/fix lift wing pilot

https://gerrit.wikimedia.org/r/974496

Change 974496 merged by Elukey:

[operations/puppet@production] profile::pyrra::filesystem: improve/fix lift wing pilot

https://gerrit.wikimedia.org/r/974496

Change 983950 had a related patch set uploaded (by Dwisehaupt; author: Dwisehaupt):

[operations/dns@master] Add dyna record for community-crm

https://gerrit.wikimedia.org/r/983950

Change 983951 had a related patch set uploaded (by Dwisehaupt; author: Dwisehaupt):

[operations/puppet@production] Set the cdn to pass requests for community-crm

https://gerrit.wikimedia.org/r/983951

Change 967950 merged by Herron:

[operations/puppet@production] pyrra: onboard varnish-requests as pilot SLO

https://gerrit.wikimedia.org/r/967950

herron renamed this task from Explore Pyrra for SLO Visualization and Management to Transition to Pyrra for SLO Visualization and Management.May 6 2024, 3:06 PM
herron updated the task description. (Show Details)

Change #1028524 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: separate slo definitions from filesystem class

https://gerrit.wikimedia.org/r/1028524

Change #1028524 merged by Herron:

[operations/puppet@production] pyrra: separate slo definitions from filesystem class

https://gerrit.wikimedia.org/r/1028524

Change #1028555 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: onboard etcd request/latency SLOs

https://gerrit.wikimedia.org/r/1028555

herron updated the task description. (Show Details)

Change #1028555 merged by Herron:

[operations/puppet@production] pyrra: onboard etcd request/latency SLOs

https://gerrit.wikimedia.org/r/1028555

Change #1028854 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: varnish: workaround site grouping limitation

https://gerrit.wikimedia.org/r/1028854

Change #1028854 merged by Herron:

[operations/puppet@production] pyrra: varnish: workaround site grouping limitation

https://gerrit.wikimedia.org/r/1028854

Change #1028864 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: etcd: add generic rules workaround

https://gerrit.wikimedia.org/r/1028864

Change #1028864 merged by Herron:

[operations/puppet@production] pyrra: etcd: add generic rules workaround

https://gerrit.wikimedia.org/r/1028864

Change #1028881 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: logstash: add generic rules workaround

https://gerrit.wikimedia.org/r/1028881

Change #1028881 merged by Herron:

[operations/puppet@production] pyrra: logstash: add generic rules workaround

https://gerrit.wikimedia.org/r/1028881

Change #1029634 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: onboard haproxy slo from grizzly

https://gerrit.wikimedia.org/r/1029634

Change #1029634 merged by Herron:

[operations/puppet@production] pyrra: onboard haproxy slo from grizzly

https://gerrit.wikimedia.org/r/1029634

Change #1029654 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: varnish: add cluster

https://gerrit.wikimedia.org/r/1029654

Change #1029654 merged by Herron:

[operations/puppet@production] pyrra: varnish: add cluster

https://gerrit.wikimedia.org/r/1029654

Change #1030227 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: trafficserver: onboard slo from grizzly

https://gerrit.wikimedia.org/r/1030227

Change #1030227 merged by Herron:

[operations/puppet@production] pyrra: trafficserver: onboard slo from grizzly

https://gerrit.wikimedia.org/r/1030227

Change #1031527 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: linkrecommendation: onboard slo from grizzly

https://gerrit.wikimedia.org/r/1031527

Change #1031527 merged by Herron:

[operations/puppet@production] pyrra: linkrecommendation: onboard slo from grizzly

https://gerrit.wikimedia.org/r/1031527

Change #961132 abandoned by Herron:

[operations/dns@master] pyrra add service dns entries

Reason:

ended up piggybacking on thanos-web for this

https://gerrit.wikimedia.org/r/961132

Change #961129 abandoned by Herron:

[operations/puppet@production] services: add pyrra conftool-data and service stub entry

Reason:

ended up piggybacking on thanos-web for this

https://gerrit.wikimedia.org/r/961129

Change #961130 abandoned by Herron:

[operations/puppet@production] pyrra: use load balancing

Reason:

ended up piggybacking on thanos-web for this

https://gerrit.wikimedia.org/r/961130

Change #1051439 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: add liftwing SLOs

https://gerrit.wikimedia.org/r/1051439

Change #1051439 merged by Herron:

[operations/puppet@production] pyrra: add liftwing SLOs

https://gerrit.wikimedia.org/r/1051439

Change #1054617 had a related patch set uploaded (by Herron; author: Herron):

[operations/puppet@production] pyrra: onboard wdqs request SLO

https://gerrit.wikimedia.org/r/1054617

Change #1077966 had a related patch set uploaded (by Herron; author: Herron):

[operations/grafana-grizzly@master] add links to SLOs migrated to pyrra

https://gerrit.wikimedia.org/r/1077966

Change #1077966 merged by Herron:

[operations/grafana-grizzly@master] add links to SLOs migrated to pyrra

https://gerrit.wikimedia.org/r/1077966