Page MenuHomePhabricator

Log messages at ERROR level on http channel: Special:ExtensionDistributor unable to connect to https://graphite.wikimedia.org
Open, Needs TriagePublic

Description

After enabling ERROR logging for all channels on group0 wikis (T228838), errors like this appeared:

https://logstash.wikimedia.org/goto/a458bea8bae20b6c5597fc538ad54f8c

HTTP/1.1 - NULL cURL error 7: Failed to connect to graphite.wikimedia.org port 443: Connection timed out (see https://curl.haxx.se/libcurl/c/libcurl-errors.html) for https://graphite.wikimedia.org/render/?...

(not copying the whole message because it's weirdly detailed and I don't want to leak something)

The errors are logged when users access https://www.mediawiki.org/wiki/Special:ExtensionDistributor (or Special:SkinDistributor).

The volume is low, but I'm filing a task anyway for evaluation, since we were not recording these logs before.

Event Timeline

I'm not sure what is this extension doing, but http requests to our edge (so, using the public graphite url) will for sure fail, and even if we use the internal url, we'd need to open a firewall rule for it.

Change #1075016 had a related patch set uploaded (by Giuseppe Lavagetto; author: Giuseppe Lavagetto):

[operations/puppet@production] service_proxy: Add a listener for the http interface of graphite

https://gerrit.wikimedia.org/r/1075016

Change #1075017 had a related patch set uploaded (by Giuseppe Lavagetto; author: Giuseppe Lavagetto):

[operations/mediawiki-config@master] ExtensionDistributor: reach graphite via the service mesh

https://gerrit.wikimedia.org/r/1075017

This probably was to embed graphite powered graphs in MediaWiki regarding extension download numbers. And the URL hasn't worked in quite a while anyway.