Page MenuHomePhabricator

Stubs files generated for prefetch use don't include metadata for last page in source
Closed, ResolvedPublic

Description

When we generate page content dumps for page ranges, a stubs file is produced to cover the page range. It misses the last page. This results in the page content dumps missing pages, which carries over to the multistream page content dumps.

Reported on the xmldatadumps mailing list, see https://lists.wikimedia.org/pipermail/xmldatadumps-l/2018-February/001399.html

Event Timeline

ArielGlenn created this task.
ArielGlenn moved this task from Backlog to Active on the Dumps-Generation board.

Change 415011 had a related patch set uploaded (by ArielGlenn; owner: ArielGlenn):
[operations/dumps@master] make sure prefetch stubs include metadata for the last page wanted

https://gerrit.wikimedia.org/r/415011

Change 415011 merged by ArielGlenn:
[operations/dumps@master] make sure prefetch stubs include metadata for the last page wanted

https://gerrit.wikimedia.org/r/415011

This is now deployed and will be in effect for the next run starting March 1st. Leaving this ticket open until we see that the stubs and page content files contain the expected pages.

I did not get this deployed until after the start of the March run, though it was ready beforehand. We will have to wait for the March 20th run to verify that the fix work. My apologies!

Did not mean to close this yet!

Verified that the last pages in the stubs show up in pages-articles files. Closing.