Page MenuHomePhabricator

DegradedArray email alerts for aqs1013 and aqs1014 are firing since April 18
Closed, ResolvedPublic

Description

Hello, both hosts are sending degraded array emails everyday since April 18. This generates a very high amount of non-actionable email alerts sent to the root@ email address that are cluttering SRE's email addresses.

As of now, both hosts have sent 131 emails each, totaling 262 emails sent from both hosts. In addition to that, those emails trigger automated responses for people who are OOO, on sabbatical, etc.

Could you please take a look at those hosts to fix the issue or turn them off if there are no plans to work on them? In either way the alerts would stop.

Thanks in advance!

Event Timeline

Based on /etc/wikimedia/contacts.yaml , these hosts are owned by Data Persistence.

As such, I'm removing Data Platform SRE tags and unassigning myself from this task.

Eevans claimed this task.

aqs1013 has been decommissioned (T379026), and aqs1014 fixed; Closing