Page MenuHomePhabricator

Higher failed fetches error rate on some caching servers
Closed, ResolvedPublic

Description

Some frontend caching servers appear to have a way higher failed fetches error rate on upload than others

eqsin.png (1×3 px, 599 KB)

esams.png (1×3 px, 429 KB)

Event Timeline

Change 537625 had a related patch set uploaded (by Vgutierrez; owner: Vgutierrez):
[operations/puppet@production] ATS: Ensure that the origin timeout is also applied to parent servers

https://gerrit.wikimedia.org/r/537625

Change 537625 merged by Vgutierrez:
[operations/puppet@production] ATS: Ensure that the origin timeout is also applied to parent servers

https://gerrit.wikimedia.org/r/537625

Change 537630 had a related patch set uploaded (by Vgutierrez; owner: Vgutierrez):
[operations/puppet@production] ATS: Avoid Proxy-Connection from spreading to varnish-fe and ats-be

https://gerrit.wikimedia.org/r/537630

Change 537630 merged by Vgutierrez:
[operations/puppet@production] ATS: Avoid Proxy-Connection from spreading to varnish-fe and ats-be

https://gerrit.wikimedia.org/r/537630

Mentioned in SAL (#wikimedia-operations) [2019-09-18T12:18:34Z] <vgutierrez> restarting ats-tls to avoid spreading Proxy-Connection header - T233205

Vgutierrez claimed this task.
Vgutierrez triaged this task as Medium priority.

Solved by preventing Proxy-Connection from spreading across varnish-fe and ats-be, thanks for reporting the issue @jijiki