Thumbor is running with MemoryLimit=512M at the moment and the limit is hit frequently:
thumbor1001:~$ sudo head -2 /var/log/syslog Sep 14 06:25:04 thumbor1001 rsyslogd: [origin software="rsyslogd" swVersion="8.4.2" x-pid="1131" x-info="http://www.rsyslog.com"] rsyslogd was HUPed Sep 14 06:25:42 thumbor1001 prometheus-node-exporter[96676]: time="2016-09-14T06:25:42Z" level=error msg="ERROR: mdadm collector failed after 0.000215s: error parsing mdstatus: error parsing mdstat: too few matches found in buildline: \tresync=PENDING" source="node_exporter.go:91" thumbor1001:~$ sudo fgrep 'Kill process' -c /var/log/syslog 116 thumbor1001:~$ date Wed Sep 14 09:07:24 UTC 2016 thumbor1001:~$
(the prometheus-node-exporter issue has been reported upstream)
Since the limit also includes external programs spawned by thumbor I'll try raising the limit to 1G and see if that helps.
Additionally we should also limit memory used by thumbor's subprocesses separatedly, so that a subprocess OOMing doesn't take down the thumbor instance.