Page MenuHomePhabricator

SystemdUnitFailed - envoyproxy on people1005
Closed, ResolvedPublic

Description

Common information

  • alertname: SystemdUnitFailed
  • instance: people1005:9100
  • name: wmf_auto_restart_envoyproxy.service
  • prometheus: ops
  • severity: critical
  • site: eqiad
  • source: prometheus
  • team: collaboration-services

Firing alerts


Event Timeline

Dzahn renamed this task from SystemdUnitFailed to SystemdUnitFailed - envoyproxy on people1005.Aug 26 2025, 5:50 PM
Dzahn claimed this task.
Dzahn subscribed.

This server is on trixie and we don't have the envoyproxy package yet. But we will get it in T402584.

extended downtime

Mentioned in SAL (#wikimedia-operations) [2025-09-01T07:40:12Z] <arnaudb@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 12:00:00 on people1005.eqiad.wmnet with reason: WIP T402953#11120672