Page MenuHomePhabricator

Upload errors due to swift failures, 503s
Closed, ResolvedPublicBUG REPORT

Description

Steps to replicate the issue (include links if applicable):

  • Users are reporting errors during upload (T328872#9956803 and below)
  • "An unknown error occurred in storage backend "local-swift-eqiad"
  • "Could not store upload in the stash (UploadStashFileException)"
  • Could not read the status of file "mwstore://local-multiwrite/local-public/archive/9/96/20240705164303!At_Swindon_Steam_Railway_Museum_2024_176.jpg".

The MediaWiki error backend-fail-stat occured: Could not read the status of file "mwstore://local-multiwrite/local-public/archive/3/3a/20240705173523!At_Swindon_Steam_Railway_Museum_2024_206.jpg".

What happens?:

Screenshot 2024-07-05 at 20.29.17.png (1×2 px, 860 KB)

Screenshot 2024-07-05 at 20.30.14.png (660×1 px, 141 KB)

What should have happened instead?:

  • No increase in errors ;)

Software version (on Special:Version page; skip for WMF-hosted wikis like Wikipedia):

Other information (browser name/version, screenshots, etc.):

Event Timeline

Thanks for reporting the issue, I'm investigating it and I've shared it with fellow SRE's for advice.

It seems a bad frontend server was the source of these errors, and a rolling restart appears to have addressed this but we'll follow up during the week to see if there are any obvious causes.

Uploads have been going much better today, thank you. :)

MatthewVernon lowered the priority of this task from Unbreak Now! to Medium.Jul 8 2024, 9:14 AM
MatthewVernon subscribed.

[this is likely related to T360913]

TheDJ claimed this task.

As the cause of this seems covered by T360913 and the incident as reported was resolved, i'm closing this.