It gets a lot more process restarts than other hosts, what looks like twice the load and CPU usage, spiky IOPS. I wonder if something is up with its hardware.
Description
Details
Project | Branch | Lines +/- | Subject | |
---|---|---|---|---|
operations/puppet | production | +1 -1 | thumbor: use memorysize_mb fact for unit MemoryLimit |
Related Objects
- Mentioned In
- T173580: $wgMaxAnimatedGifArea is not honored by Thumbor
- Mentioned Here
- T173580: $wgMaxAnimatedGifArea is not honored by Thumbor
Event Timeline
For some reason the MemoryLimit=15% change from https://gerrit.wikimedia.org/r/#/c/367373/ doesn't seem to be applied on thumbor1003 and that causes io spikes and additional latency
Change 377264 had a related patch set uploaded (by Filippo Giunchedi; owner: Filippo Giunchedi):
[operations/puppet@production] thumbor: use memorysize_mb fact for unit MemoryLimit
Change 377264 merged by Filippo Giunchedi:
[operations/puppet@production] thumbor: use memorysize_mb fact for unit MemoryLimit
Mentioned in SAL (#wikimedia-operations) [2017-09-11T14:40:06Z] <godog> roll-restart thumbor to apply https://gerrit.wikimedia.org/r/#/c/377264/ and upgrade to 1.4 - T173580 T174997
Indeed, the latency now is the same across all hosts and I've deployed a fix for MemoryLimit to actually DTRT with jessie's systemd, resolving.