Page MenuHomePhabricator

SystemdUnitFailed - wmf_auto_restart_vrts_rsync.service on vrts1003
Closed, ResolvedPublic

Description

Common information

  • alertname: SystemdUnitFailed
  • instance: vrts1003:9100
  • name: wmf_auto_restart_vrts_rsync.service
  • prometheus: ops
  • severity: critical
  • site: eqiad
  • source: prometheus
  • team: collaboration-services

Firing alerts


Event Timeline

Dzahn renamed this task from SystemdUnitFailed to SystemdUnitFailed - wmf_auto_restart_vrts_rsync.service on vrts1003.Feb 3 2026, 8:22 PM
Dzahn subscribed.

Change #1236364 had a related patch set uploaded (by Dzahn; author: Dzahn):

[operations/puppet@production] VRTS: fix service name for profile::auto_restarts::service for rsync

https://gerrit.wikimedia.org/r/1236364

Change #1236364 merged by Dzahn:

[operations/puppet@production] VRTS: fix service name for profile::auto_restarts::service for rsync

https://gerrit.wikimedia.org/r/1236364

Mentioned in SAL (#wikimedia-operations) [2026-02-03T23:23:29Z] <mutante> vrts1003 - fix systemd state: sed -i 's/vrts_rsync/rsync/' /lib/systemd/system/wmf_auto_restart_vrts_rsync.service ; systemctl daemon-reload - T416380 T135991

https://gerrit.wikimedia.org/r/1236364

but after that also:

root@vrts1003:~# sed -i 's/vrts_rsync/rsync/' /lib/systemd/system/wmf_auto_restart_vrts_rsync.service

root@vrts1003:~# systemctl daemon-reload

then:

root@vrts1003:/home/dzahn# systemctl list-units --state=failed
  UNIT LOAD ACTIVE SUB DESCRIPTION
0 loaded units listed.
23:28 < jinxer-wm> RESOLVED: SystemdUnitFailed: wmf_auto_restart_vrts_rsync.service on vrts1003:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state -