Page MenuHomePhabricator

Improve monitoring knowledge for Elasticsearch garbage collection
Closed, ResolvedPublic

Description

Per T323646 , we are still seeing a lot of garbage collection-related alerts.

Creating this ticket to improve prometheus labeling and update our docs with more details about our current GC monitoring and alerts.

Event Timeline

Change 864829 had a related patch set uploaded (by Bking; author: Ebernhardson):

[operations/puppet@production] prom: Add elasticsearch cluster name to exported latency metrics

https://gerrit.wikimedia.org/r/864829

Change 864829 merged by Bking:

[operations/puppet@production] prom: Add elasticsearch cluster name to exported latency metrics

https://gerrit.wikimedia.org/r/864829

Per @Gehel suggestion, we may want to get back in touch with the jclarity.com folks (see T178271 )

bking claimed this task.
bking moved this task from Incoming to Needs review on the Discovery-Search (Current work) board.

Marking resolved per triage meeting,. Will open a separate ticket for the jclarity/JVM tuning discussion.