Per today's Wikimedia-Search IRC conversation with @BCornwall , we inadvertently triggered an LVS alert for high network bandwidth by running an Elastic snapshot/restore (See Grafana). Creating this ticket to:
InvestigateDiscussways to reducestrain on external services (LVS, Swift) during snapshot operations with owning teams.- Implement changes, if
possiblenecessary.
Updated task description above to reflect IRC discussions re: alert detuning (Traffic/Infrastructure Foundations) vs. outbound traffic rate-limiting from ES hosts (Data Platform SRE).