Page MenuHomePhabricator

AQS Cassandra cluster: host/service failures should notify Data Persistence
Closed, ResolvedPublic

Description

Notifications for the AQS Cassandra cluster are being sent to Data Platform, instead of Data Persistence.

Additionally, notifications for the RESTBase & sessionstore clusters are using an old team-services contact, which goes to me (@Eevans) —due to historical reasons— but would be better moved to a Data Persistence team contact.


See also: T361603: aqs2001.codfw.wmnet down

Related Objects

Event Timeline

Eevans triaged this task as High priority.Apr 2 2024, 8:47 PM

Change #1024611 had a related patch set uploaded (by Btullis; author: Btullis):

[operations/puppet@production] Update the ownership of the aqs cassandra cluster

https://gerrit.wikimedia.org/r/1024611

Change #1024611 merged by Btullis:

[operations/puppet@production] Update the ownership of the aqs cassandra cluster

https://gerrit.wikimedia.org/r/1024611

I think that this is done now. I had a quick glance at the alerts repo, but I didn't see anything that needed changing. Feel free to reopen if I have missed anything.