Page MenuHomePhabricator

Investigate sharp increase in lost Arc Lamp samples (arclamp_client_error.exception)
Closed, ResolvedPublic

Description

https://grafana.wikimedia.org/d/yVf-D1RWk/arc-lamp?orgId=1

Screenshot 2023-10-02 at 16.22.55.png (1×1 px, 228 KB)

@fgiunchedi mentioned this ramp up likely correlates with MW-on-k8s rollout.

Event Timeline

Change 962725 had a related patch set uploaded (by Krinkle; author: Krinkle):

[operations/mediawiki-config@master] Profiler: Enable logging of caught Redis exceptions to Logstash

https://gerrit.wikimedia.org/r/962725

Change 962725 merged by jenkins-bot:

[operations/mediawiki-config@master] Profiler: Enable logging of caught Redis exceptions to Logstash

https://gerrit.wikimedia.org/r/962725

Allowing egress redis to arclamp hosts did the trick, no exceptions anymore from arclamp

fgiunchedi claimed this task.

Optimistically resolving

Quoting here for reference, as it tagged a different webperf issue by accident:

Change 963024 merged by Filippo Giunchedi:

[operations/deployment-charts@master] services: fix xenon/arclamp redis egress rules

https://gerrit.wikimedia.org/r/963024