Page MenuHomePhabricator

https://stream.wikimedia.org/v2/stream/recentchange 502 Bad Gateway
Closed, ResolvedPublic

Description

$ curl https://stream.wikimedia.org/v2/stream/recentchange
<html>
<head><title>502 Bad Gateway</title></head>
<body bgcolor="white">
<center><h1>502 Bad Gateway</h1></center>
<hr><center>nginx/1.13.6</center>
</body>
</html>

Event Timeline

Xqt raised the priority of this task from High to Needs Triage.Mar 30 2018, 10:08 AM
Xqt added projects: Event-Platform, EventStreams.

Looks like an upstream issue

Xqt claimed this task.

Working again. Thanks.

Again the same problem as described above.

Xqt removed Xqt as the assignee of this task.Mar 30 2018, 1:36 PM
zhuyifei1999 renamed this task from EventStreams fails due to Bad Gateway for url: https://stream.wikimedia.org/v2/stream/recentchange to https://stream.wikimedia.org/v2/stream/recentchange 502 Bad Gateway.Mar 31 2018, 4:56 PM
zhuyifei1999 triaged this task as High priority.
zhuyifei1999 updated the task description. (Show Details)
zhuyifei1999 subscribed.

Raising priority to High. A production service is broken here.

The service is back up and working. From the logs, it seems that some clients fell behind, causing the service the accumulate messages and exceed the set memory limit. As a first-course remedy, we can increase the limits a bit as there is headroom for that on the hosts.

Change 423341 had a related patch set uploaded (by Mobrovac; owner: Mobrovac):
[mediawiki/services/eventstreams/deploy@master] Config: Increase memory and heartbeat limits

https://gerrit.wikimedia.org/r/423341

Change 423341 abandoned by Mobrovac:
Config: Increase memory and heartbeat limits

Reason:
Not viable because of T199813

https://gerrit.wikimedia.org/r/423341

Pchelolo claimed this task.
Pchelolo subscribed.

I believe after T199813 was fixed this should not happen again.