The monitoring for Elasticsearch doesn't seem to notice when an OOM has happened on a node. I have very seldom seen a node recover on it's own from an OOM so it seems like something worth of alerting about.
Description
Description
Details
Details
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
elasticsearch: exit the JVM on OutOfMemoryError | operations/puppet | production | +3 -0 |
Related Objects
Related Objects
Event Timeline
Comment Actions
I think I remember there being a java jre command line flag that lets you install an OOM signal handler. I have vague memories of using that to send alerts in a long forgotten past as a Java Shop Administrator™.
Comment Actions
Change 487787 had a related patch set uploaded (by Gehel; owner: Gehel):
[operations/puppet@production] elasticsearch: exit the JVM on OutOfMemoryError
Comment Actions
Instead of monitoring this specific error, let's just configure the JVM to restart on memory errors.
Comment Actions
Change 487787 merged by Gehel:
[operations/puppet@production] elasticsearch: exit the JVM on OutOfMemoryError