Page MenuHomePhabricator

dumpBackup.php: Add --sort option
Open, Needs TriagePublic

Description

dumpBackup.php do not sort the entries and there is no option to do it as far as I know.

I'm using importDump.php and looking at http://localhost:8080/mediawiki/Special:AllPages to govern the upload process, but its only useful to estimate the quantity that has been uploaded. I cannot verify that all articles that begins with "A", "B", and "C" for example has been uploaded.

I suggest that you add option --sort used to sort by namespace. Example --sort=namespace:0

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMay 14 2017, 4:17 PM

We dump ordering by page id, and that is so that previous dump files can be re-used when building the dump file for the current run. This saves wear and tear on the databases, as well as being much faster.

As long as you have the list of page titles in the order they are being imported (a grep of '<title>' in the xml file will give you that), you can give the --report option to importDump and keep track of where it is in its processing.