we should have a (bash?) script for stitching multiple shards of a json dump into a single dump (and probably compress it at the same time).
Version: unspecified
Severity: normal
Whiteboard: u=dev c=infrastructure p=0
we should have a (bash?) script for stitching multiple shards of a json dump into a single dump (and probably compress it at the same time).
Version: unspecified
Severity: normal
Whiteboard: u=dev c=infrastructure p=0
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T88728 Improve Wikimedia dumping infrastructure | |||
Open | None | T88991 improve Wikidata dumps [tracking] | |||
Resolved | JanZerebecki | T70366 add script for stitching json dump shards |
This is already done in operations/puppet.git/modules/snapshot/files/dumpwikidatajson.sh