Both mw-content-history-reconcile-enrich and mw-content-history-reconcile-enrich-next JobManagers on dse-k8s-eqiad are experiencing frequent, unexpected restarts.
This is occurring despite the applications being mostly idle. I haven't observed anything suspicious in the pod logs or the Flink JobManager UI.
Grafana shows that pod memory usage increases over time until the container is OOM-killed. However, the Flink JobManager UI reports stable JVM heap allocations of 256MB.
Some context in this Slack thread.