Page MenuHomePhabricator

lmata (Leo Mata)
Disabled

Projects (12)

User Details

User Since
May 14 2020, 7:26 PM (291 w, 4 d)
Roles
Disabled
IRC Nick
lmata
LDAP User
LMata
MediaWiki User
LMata (WMF) [ Global Accounts ]

Recent Activity

Jul 30 2025

lmata updated subscribers of T398229: FY25-26 SDS2.1.3 Reliability - Production Monitoring.
Jul 30 2025, 8:43 PM · OKR-Work, Test Kitchen (Experiment Platform Sprint 16), Epic
lmata edited projects for T400443: Allow wider Alertmanager API RO access, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability.
Jul 30 2025, 2:09 PM · SRE Observability (FY2025/2026-Q1)

Jul 23 2025

lmata moved T399807: Allow team customization for service::catalog probes from Inbox to Prioritized on the Observability-Alerting board.
Jul 23 2025, 2:05 PM · Observability-Alerting
lmata triaged T399807: Allow team customization for service::catalog probes as Medium priority.
Jul 23 2025, 2:05 PM · Observability-Alerting

Jul 16 2025

lmata moved T399195: Update logging and monitoring for multiple session storage backends from Inbox to Radar on the observability board.
Jul 16 2025, 2:05 PM · Patch-For-Review, observability, MediaWiki-Core-AuthManager, MediaWiki-Platform-Team, OKR-Work

Jul 9 2025

lmata moved T398605: Prometheus puppettization has a very large directory from Inbox to Radar on the observability board.
Jul 9 2025, 2:03 PM · Observability-Metrics, observability
lmata triaged T398605: Prometheus puppettization has a very large directory as Medium priority.
Jul 9 2025, 2:03 PM · Observability-Metrics, observability

Jul 2 2025

lmata updated the task description for T398302: On-call batphone escalation configuration holidays FY2025/26.
Jul 2 2025, 10:38 PM · SRE Observability (FY2025/2026-Q1)
lmata moved T391516: https://performance.wikimedia.org/php-profiling/ leads to 404 for all listed sources from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Regression, observability, Arc-Lamp, WikimediaDebug
lmata moved T397264: create a new place for prometheus/alertmanager checks not tied to physical machines from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Alerting, collaboration-services, SRE
lmata moved T305223: Clean up stale Prometheus target and rules files from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · Patch-For-Review, Observability-Metrics, SRE Observability (FY2025/2026-Q1)
lmata moved T228380: Tech debt: sunsetting of Graphite from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · MW-1.46-notes (1.46.0-wmf.4; 2025-11-25), SRE Observability (FY2025/2026-Q1), MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Patch-For-Review, Technical-Debt, Observability-Metrics
lmata moved T393630: Cookbook downtiming does not work, continues anyway from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1)
lmata moved T395032: Cookbook sre.hosts.remove_downtime does not remove silences from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Alerting, SRE-tools
lmata moved T397427: librenms-syslog leaks memory from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Infrastructure-Foundations, SRE
lmata moved T396862: Improve titan hosts stateless-ness from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · Observability-Metrics
lmata moved T397756: Kafka-logging -> Bookworm from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · Observability-Logging, SRE Observability (FY2025/2026-Q1)
lmata moved T397757: Kafkamon -> Bookworm from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · Observability-Logging, SRE Observability (FY2025/2026-Q1)
lmata moved T392886: Revisit default Istio histogram buckets from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Patch-For-Review, Observability-Metrics
lmata moved T372242: Alert on unscrapable pods from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Alerting, serviceops, Kubernetes
lmata moved T321808: Port all Icinga checks to Prometheus/Alertmanager from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Alerting
lmata moved T372845: Migrate all o11y services to nftables from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T388506: Implement a less noisy way to remove nrpe checks (without UNKNOWN spam) from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Alerting
lmata moved T390196: Deploy and document a method to dump logs from logstash from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Logging
lmata moved T393894: New version of Grafana makes it not possible to remove option in long list of values from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Grafana
lmata moved T394069: Rendering Graph's as images times out on Grafana 11 from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1)
lmata moved T395441: Port all Icinga checks to Prometheus/Alertmanager: preparation from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T395445: Migrate away from Icinga service checks that can trigger pages (non-db only) from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T395443: Automate Snapshot Generation for Icinga to Prometheus/Alermanager Migration from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · Patch-For-Review, SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T395442: Setup reliable migration dashboards from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T395446: Evaluate which solution we could adopt as a drop-in replacement for NRPE (and start prototyping) from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T396626: Hardware retirement Graphite Infrastructure (ETA June 2026) from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Technical-Debt, Observability-Metrics
lmata moved T395448: Discuss about "host down" semantics from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T395447: Prototype / experiment with moving raid checks to alertmanager from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T395553: ircecho (icinga-wm) doesn't automatically restart if not connected from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Alerting, Sustainability (Incident Followup)
lmata moved T395449: Reimage cookbook icinga logic review from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics
lmata moved T398073: Ensure DPE SRE can receive alerts for applications hosted in wikikube from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, SRE Observability (FY2025/2026-Q1), serviceops
lmata moved T395916: Reduce Pyrra's default window from 12w to 4w from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics, SRE-SLO
lmata moved T398311: Add links in the Pyrra rolling dashboards to point to their calendar ones in Grafana from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), observability, SRE-SLO
lmata moved T398444: More frequent Puppet runs on the alert hosts? from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · Infrastructure-Foundations, SRE-tools, SRE Observability (FY2025/2026-Q1)
lmata moved T398313: Add a banner to slo.wikimedia.org explaining rolling vs calendar views from Inbox to Up next on the SRE Observability (FY2025/2026-Q1) board.
Jul 2 2025, 2:12 PM · SRE Observability (FY2025/2026-Q1), observability, SRE-SLO
lmata edited projects for T398073: Ensure DPE SRE can receive alerts for applications hosted in wikikube, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability.
Jul 2 2025, 2:10 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), Essential-Work, SRE Observability (FY2025/2026-Q1), serviceops
lmata edited projects for T398444: More frequent Puppet runs on the alert hosts?, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability.
Jul 2 2025, 2:07 PM · Infrastructure-Foundations, SRE-tools, SRE Observability (FY2025/2026-Q1)
lmata added a project to T398311: Add links in the Pyrra rolling dashboards to point to their calendar ones in Grafana: SRE Observability (FY2025/2026-Q1).
Jul 2 2025, 2:06 PM · SRE Observability (FY2025/2026-Q1), observability, SRE-SLO
lmata moved T398313: Add a banner to slo.wikimedia.org explaining rolling vs calendar views from Inbox to Radar on the observability board.
Jul 2 2025, 2:05 PM · SRE Observability (FY2025/2026-Q1), observability, SRE-SLO
lmata added a project to T398313: Add a banner to slo.wikimedia.org explaining rolling vs calendar views: SRE Observability (FY2025/2026-Q1).
Jul 2 2025, 2:05 PM · SRE Observability (FY2025/2026-Q1), observability, SRE-SLO
lmata archived SRE Observability (FY2024/2025-Q4).
Jul 2 2025, 1:53 PM
lmata moved T391677: Audit dashboards using histogram_quantile on mediawiki_WikimediaEvents_editResponseTime from Radar to Done on the SRE Observability (FY2024/2025-Q4) board.
Jul 2 2025, 1:53 PM · MediaWiki-Platform-Team, MW-1.44-notes (1.44.0-wmf.27; 2025-04-29), SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T387350: liftwing SLO performance issues from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jul 2 2025, 1:53 PM · SRE Observability (FY2024/2025-Q4), SRE-SLO, Observability-Metrics
lmata edited projects for T395916: Reduce Pyrra's default window from 12w to 4w, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability (FY2024/2025-Q4).
Jul 2 2025, 1:52 PM · SRE Observability (FY2025/2026-Q1), Observability-Metrics, SRE-SLO

Jul 1 2025

lmata updated the task description for T398302: On-call batphone escalation configuration holidays FY2025/26.
Jul 1 2025, 11:44 AM · SRE Observability (FY2025/2026-Q1)
lmata created T398302: On-call batphone escalation configuration holidays FY2025/26.
Jul 1 2025, 11:43 AM · SRE Observability (FY2025/2026-Q1)

Jun 30 2025

lmata moved T393738: The stunnel4 service requires a manual restart after disabling/enabling sync in Grafana hosts from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:07 PM · SRE Observability (FY2024/2025-Q4)
lmata moved T394045: When selecting a DC some Grafana panels show instances for other DC from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:07 PM · SRE Observability (FY2024/2025-Q4)
lmata moved T394319: Move thanos cache out of process from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:07 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T394318: Revisit thanos queries concurrency and limits from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:07 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T395098: Upgrade to Grafana 12.0.1 from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4)
lmata moved T395130: Migrate prometheus7001 to prometheus7002 from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T392488: kafka-logging2005 is down since six days from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE, SRE Observability (FY2024/2025-Q4), DC-Ops, ops-codfw
lmata moved T393439: Graphite data sources broken on grafana-next from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · Grafana, SRE Observability (FY2024/2025-Q4)
lmata moved T391661: Weekly indices are force-merged by curator every day for a week from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Logging
lmata moved T390194: Add read-only users capability to logs-api.svc from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4)
lmata moved T385693: thanos-query overload due to heavy queries from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T383966: Upgrade Thanos to 0.38.0 from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T381665: module to define custom Prometheus alerts directly in Puppet from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Alerting, Patch-For-Review
lmata moved T383232: Move k8s Prometheus instances to new Prometheus hw in eqiad/codfw from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T372457: Remove librenms -> graphite integration, replace with gnmi from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics, Cloud-VPS, cloud-services-team
lmata moved T370772: Prometheus eqiad/codfw hw expansion architecture options from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T384841: Upgrade to Grafana 11 from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4)
lmata moved T384831: Repeated library panels in Grafana showing only after refresh, not on first load from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), serviceops, Observability-Metrics
lmata moved T395688: Icinga event handler 'raid_handler' failed to create a Phabricator task from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:06 PM · SRE Observability (FY2024/2025-Q4), Observability-Alerting
lmata moved T359271: (Analytics?) Migrate MediaWiki.TemplateData to statslib from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:05 PM · SRE Observability (FY2024/2025-Q4), Editing-team, VisualEditor, Observability-Metrics
lmata moved T397967: scap logs are being dead-lettered from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:05 PM · SRE Observability (FY2024/2025-Q4), Observability-Logging
lmata moved T395819: Create a tool to validate ECS logs from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:05 PM · SRE Observability (FY2024/2025-Q4), Observability-Logging
lmata moved T391333: Revisit default envoy histogram buckets from In progress to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:05 PM · Patch-For-Review, envoy, serviceops, SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T385054: Make sure Grafana-based alerts based on Graphite dashboards/panels are migrated too from Up next to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:05 PM · SRE Observability (FY2024/2025-Q4), Technical-Debt, Observability-Metrics
lmata moved T389357: Audit dashboards using histogram_quantile on big envoy metrics and move to recording rules from In progress to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:05 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata moved T392994: Move Thanos trace sampling to native and off otlp coll from In progress to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 30 2025, 4:05 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics
lmata edited projects for T353912: Observability Bookworm upgrades, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability (FY2024/2025-Q4).
Jun 30 2025, 4:05 PM · SRE Observability (FY2025/2026-Q1), observability, Patch-For-Review
lmata edited projects for T343020: Converting MediaWiki Metrics to StatsLib, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability (FY2024/2025-Q4).
Jun 30 2025, 4:05 PM · SRE Observability (FY2025/2026-Q1), Essential-Work, Editing-team (Kanban Board), MW-1.44-notes (1.44.0-wmf.28; 2025-05-06), Patch-For-Review, Observability-Metrics
lmata edited projects for T288622: All Prometheus based alerts move from Icinga to alert manager exclusively, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability (FY2024/2025-Q4).
Jun 30 2025, 4:05 PM · Patch-For-Review, SRE Observability (FY2025/2026-Q1), Observability-Alerting
lmata edited projects for T350592: EPIC: migrate in use metrics and dashboards to statslib, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability (FY2024/2025-Q4).
Jun 30 2025, 4:04 PM · SRE Observability (FY2025/2026-Q1), MW-1.43-notes (1.43.0-wmf.21; 2024-09-03), Epic, MW-1.42-notes (1.42.0-wmf.15; 2024-01-23), MediaWiki-Platform-Team (Radar), Observability-Metrics

Jun 25 2025

lmata edited projects for T397427: librenms-syslog leaks memory, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability.
Jun 25 2025, 2:09 PM · SRE Observability (FY2025/2026-Q1), Infrastructure-Foundations, SRE
lmata moved T391516: https://performance.wikimedia.org/php-profiling/ leads to 404 for all listed sources from Inbox to Radar on the observability board.
Jun 25 2025, 2:07 PM · SRE Observability (FY2025/2026-Q1), Regression, observability, Arc-Lamp, WikimediaDebug
lmata edited projects for T397264: create a new place for prometheus/alertmanager checks not tied to physical machines, added: Observability-Alerting, SRE Observability (FY2025/2026-Q1); removed observability.
Jun 25 2025, 2:05 PM · SRE Observability (FY2025/2026-Q1), Observability-Alerting, collaboration-services, SRE

Jun 23 2025

lmata moved T368786: Add support for nesting to StatsFactory->getTiming start/stop feature from Inbox to Done on the SRE Observability (FY2024/2025-Q4) board.
Jun 23 2025, 5:19 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics, MediaWiki-Platform-Team, MW-1.45-notes (1.45.0-wmf.7; 2025-06-24), MediaWiki-libs-Stats
lmata added a project to T368786: Add support for nesting to StatsFactory->getTiming start/stop feature: SRE Observability (FY2024/2025-Q4).
Jun 23 2025, 5:19 PM · SRE Observability (FY2024/2025-Q4), Observability-Metrics, MediaWiki-Platform-Team, MW-1.45-notes (1.45.0-wmf.7; 2025-06-24), MediaWiki-libs-Stats
lmata closed T369122: On-call batphone escalation configuration holidays FY2024/25 as Resolved.
Jun 23 2025, 3:04 PM · SRE Observability (FY2024/2025-Q4)

Jun 19 2025

lmata moved T363753: Only select o11y-owned datasources on the Grafana Datasource utilization dashboard from Inbox to Prioritized on the Observability-Metrics board.
Jun 19 2025, 3:20 PM · SRE Observability (FY2024/2025-Q3), Observability-Metrics
lmata moved T363753: Only select o11y-owned datasources on the Grafana Datasource utilization dashboard from Inbox to Done on the SRE Observability (FY2024/2025-Q3) board.
Jun 19 2025, 3:20 PM · SRE Observability (FY2024/2025-Q3), Observability-Metrics
lmata closed T363753: Only select o11y-owned datasources on the Grafana Datasource utilization dashboard, a subtask of T350591: Audit legacy mediawiki stats used in production dashboards, as Resolved.
Jun 19 2025, 3:20 PM · SRE Observability (FY2023/2024-Q3), Patch-For-Review, Observability-Metrics
lmata closed T363753: Only select o11y-owned datasources on the Grafana Datasource utilization dashboard as Resolved.

Closing, this is either resolved or no longer necessary per the current status of T228380: Tech debt: sunsetting of Graphite

Jun 19 2025, 3:20 PM · SRE Observability (FY2024/2025-Q3), Observability-Metrics
lmata updated the task description for T369122: On-call batphone escalation configuration holidays FY2024/25.
Jun 19 2025, 5:51 AM · SRE Observability (FY2024/2025-Q4)

Jun 18 2025

lmata closed T379156: Change/fix real user performance alert to only use Prometheus, a subtask of T228380: Tech debt: sunsetting of Graphite, as Resolved.
Jun 18 2025, 2:33 PM · MW-1.46-notes (1.46.0-wmf.4; 2025-11-25), SRE Observability (FY2025/2026-Q1), MW-1.45-notes (1.45.0-wmf.1; 2025-05-13), Patch-For-Review, Technical-Debt, Observability-Metrics
lmata closed T379156: Change/fix real user performance alert to only use Prometheus, a subtask of T384459: Web Performance responsibilities, as Resolved.
Jun 18 2025, 2:33 PM · PM, Test-Platform (Radar)
lmata closed T379156: Change/fix real user performance alert to only use Prometheus as Resolved.

Hi! Discussed this task with the team today, they've shared that these have been migrated. I'm resolving this task for now. Please re-open if this is not the case and there is still work pending.

Jun 18 2025, 2:33 PM · Technical-Debt, Observability-Metrics
lmata edited projects for T305223: Clean up stale Prometheus target and rules files, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability.
Jun 18 2025, 2:20 PM · Patch-For-Review, Observability-Metrics, SRE Observability (FY2025/2026-Q1)
lmata edited projects for T393630: Cookbook downtiming does not work, continues anyway, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability.
Jun 18 2025, 2:17 PM · SRE Observability (FY2025/2026-Q1)
lmata edited projects for T396862: Improve titan hosts stateless-ness, added: SRE Observability (FY2025/2026-Q1); removed SRE Observability.
Jun 18 2025, 2:13 PM · Observability-Metrics
lmata moved T382181: Investigate adding toolforge projects to Prometheus from Inbox to Radar on the observability board.
Jun 18 2025, 2:12 PM · observability