we should have a (bash?) script for stitching multiple shards of a json dump into a single dump (and probably compress it at the same time).
Version: unspecified
Severity: normal
Whiteboard: u=dev c=infrastructure p=0
we should have a (bash?) script for stitching multiple shards of a json dump into a single dump (and probably compress it at the same time).
Version: unspecified
Severity: normal
Whiteboard: u=dev c=infrastructure p=0
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Open | None | T88728 Improve Wikimedia dumping infrastructure | |||
| Open | None | T88991 improve Wikidata dumps [tracking] | |||
| Resolved | JanZerebecki | T70366 add script for stitching json dump shards |
This is already done in operations/puppet.git/modules/snapshot/files/dumpwikidatajson.sh