Page MenuHomePhabricator

Parsoid throwing 404 error for valid page/oldid
Closed, ResolvedPublic

Description

https://en.wikipedia.org/wiki/19th_Virginia_Infantry

I'm consistently getting:

The page at https://en.wikipedia.org says:
Error loading data from server: parsoidserver-http-bad-status: 404. Would you like to retry?

Other articles seem unaffected.


Version: unspecified
Severity: normal
URL: http://titanium.wikimedia.org/enwiki/19th_Virginia_Infantry?oldid=561398978

Details

Reference
bz50121

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 22 2014, 1:59 AM
bzimport added a project: Parsoid.
bzimport set Reference to bz50121.

I fixed the particular issue by asking for a purged version from the cache directly, but this might be a wider issue? oldid=561398978

So far this seems to be an isolated case, which would be compatible with us not properly distinguishing between API request errors and non-existing pages. A single API hickup would then lead to a cached 404. Needs investigation.

[Parsoid component reorg by merging JS/General and General. See bug 50685 for more information. Filter bugmail on this comment. parsoidreorg20130704]

(In reply to comment #5)

And a third case on
http://en.wikipedia.org/w/index.
php?title=National_Institute_of_Open_Schooling&oldid=563517112
(which I discovered in my volunteer mode)

I am not able to reproduce this after a cache purge. This means that these errors are definitely not systematic to some title or revision, which fits with the API error suspicion.

Change 173102 had a related patch set uploaded by Arlolra:
(Bug 50121) Only return 404 page does not exist

https://gerrit.wikimedia.org/r/173102

Change 173102 merged by jenkins-bot:
(Bug 50121) Only return 404 when page does not exist

https://gerrit.wikimedia.org/r/173102