Page MenuHomePhabricator

SystemdUnitFailed - lists1004 - wmf_auto_restart_exim4
Closed, ResolvedPublic

Description

Common information

  • alertname: SystemdUnitFailed
  • instance: lists1004:9100
  • name: wmf_auto_restart_exim4.service
  • prometheus: ops
  • severity: critical
  • site: eqiad
  • source: prometheus
  • team: collaboration-services

Firing alerts


Related Objects

Event Timeline

LSobanski renamed this task from SystemdUnitFailed to SystemdUnitFailed - lists1004 - wmf_auto_restart_exim4.Jul 2 2024, 12:26 PM

Mentioned in SAL (#wikimedia-operations) [2024-07-02T17:06:24Z] <mutante> lists1004 - sudo systemctl start wmf_auto_restart_exim4 (T369017)

was:

Jul 02 06:14:02 lists1004 wmf-auto-restart[124410]: WARNING: 2024-07-02 06:14:02,464 : Service exim4 uses a legacy sysvinit script
Jul 02 06:14:02 lists1004 wmf-auto-restart[124410]: WARNING: 2024-07-02 06:14:02,464 : Consider using a systemd unit instead
Jul 02 06:14:02 lists1004 wmf-auto-restart[124410]: ERROR: 2024-07-02 06:14:02,584 : Failed to restart service exim4:
Jul 02 06:14:02 lists1004 wmf-auto-restart[124410]: ERROR: 2024-07-02 06:14:02,585 : b'Job for exim4.service failed.\nSee "systemctl status exim4.service" and "journalctl -xeu exim4.ser>
Jul 02 06:14:02 lists1004 systemd[1]: wmf_auto_restart_exim4.service: Main process exited, code=exited, status=1/FAILURE

but resolved:

[lists1004:~] $ sudo systemctl status wmf_auto_restart_exim4
○ wmf_auto_restart_exim4.service - Auto restart job: exim4
     Loaded: loaded (/lib/systemd/system/wmf_auto_restart_exim4.service; static)
     Active: inactive (dead) since Tue 2024-07-02 17:05:42 UTC; 7s ago
TriggeredBy: ● wmf_auto_restart_exim4.timer
       Docs: https://wikitech.wikimedia.org/wiki/Monitoring/systemd_unit_state
    Process: 1594672 ExecStart=/usr/local/sbin/wmf-auto-restart -s exim4 (code=exited, status=0/SUCCESS)
   Main PID: 1594672 (code=exited, status=0/SUCCESS)

I can tell the issue is no longer an issue. But I don't know yet what actually caused:

2024-07-02T06:14:02.583273+00:00 lists1004 systemd[1]: Failed to start exim4.service - LSB: exim Mail Transport Agent.

Do you know?

This seems to have been a blip that hasn't reoccurred.