Page MenuHomePhabricator

Database dumps, smaller - pages-articles.xml should be avalaible in .7z as well
Closed, DeclinedPublic

Description

Author: adam1213

Description:
pages-articles.xml.bz2 "Articles, templates, image descriptions, and primary meta-pages" ("the archive most mirror sites will probably want") is currently only avaliable in .bz2
It should be made available in .7z as well

pages-meta-history.xml.7z is 3.2 GB the .bz2 is 5 gigs
It would appear that .7z is 64% of the size of .bz2
So pages-articles.xml.bz2 could be aprox 1.7 gigs, instead of 2.7 gigs. This may even reduce some server load/bandwith.


Version: unspecified
Severity: enhancement
URL: http://download.wikimedia.org/enwiki/20070716/

Details

Reference
bz10693

Event Timeline

bzimport raised the priority of this task from to Lowest.Nov 21 2014, 9:52 PM
bzimport set Reference to bz10693.

The extreme size savings for the history dumps are due to better compression of the multiple-revision runs. Current-version dumps compress to about the same size with .bz2 and .7z, but p7zip takes much much longer to run.