Page MenuHomePhabricator

Memcached error for key "WANCache:v:enwiki:image_redirect:254363f3d14af58bbe12c644ee69ccf7" on server "/var/run/nutcracker/nutcracker.sock:0": A TIMEOUT OCCURRED
Closed, ResolvedPublic

Description

Both the memcached dashboards have a new trending error:
https://logstash.wikimedia.org/#/dashboard/elasticsearch/memcached-errors
https://logstash.wikimedia.org/#/dashboard/elasticsearch/memcached-serious

Memcached error for key "WANCache:v:enwiki:image_redirect:254363f3d14af58bbe12c644ee69ccf7" on server "/var/run/nutcracker/nutcracker.sock:0": A TIMEOUT OCCURRED
Memcached error for key "WANCache:v:enwiki:image_redirect:254363f3d14af58bbe12c644ee69ccf7" on server "/var/run/nutcracker/nutcracker.sock:0": SERVER ERROR

Two two errors for that key only together make up 1890 errors per 5 minutes (out of ~ 1930 error events from memcached in total).

It seems equally distributed among the different mw hosts.

It started right after midnight, 2015-06-18 00:00 UTC.

Event Timeline

Krinkle raised the priority of this task from to Needs Triage.
Krinkle updated the task description. (Show Details)
Krinkle subscribed.

Given the timestamp, I suspect it's related to todays Main Page. And indeed that image is on the Main Page. Looking for the first two characters of the hash, "254363f3" -> "2/25/", I found:

<img src=".. upload.wikimedia.org/wikipedia/commons/thumb/2/25/Grand_Duchess_Anastasia_Nikolaevna_Crisco_edit_letters_removed.jpg/300px-Grand_Duchess_Anastasia_Nikolaevna_Crisco_edit_letters_removed.jpg" width="300" height="432" ..

https://en.wikipedia.org/wiki/File:Grand_Duchess_Anastasia_Nikolaevna_Crisco_edit_letters_removed.jpg
https://commons.wikimedia.org/wiki/File:Grand_Duchess_Anastasia_Nikolaevna_Crisco_edit_letters_removed.jpg

Krinkle set Security to None.

Note that since nutcracker is just a proxy, it could still indicate a problem on an mc host (the one that the key maps to). Not sure what logs we have there.

demon subscribed.

Marking high because mentioned task is also high.

fgiunchedi claimed this task.
fgiunchedi subscribed.

looks like this was fixed, or supplanted by different errors from https://logstash.wikimedia.org/#/dashboard/elasticsearch/memcached tentatively resolving