Page MenuHomePhabricator
Authored By
cscott
Jul 13 2015, 5:04 PM
Size
490 B
Referenced Files
None
Subscribers
None

shuffle.sh

#!/bin/bash
LANG="en de nl fr it ru es sv pl ja ar he hi ko zh"
# link prefix languages
LANG="ar ckb cu cv hy is kaa ka lbe ln mzn pnb uk uz"
HOWMANY=10000
for l in $LANG ; do
echo $l
zcat ${l}wiki-latest-all-titles-in-ns0.gz | sort -R > ${l}wiki-shuffled-titles
head -${HOWMANY} ${l}wiki-shuffled-titles > ${l}wiki-${HOWMANY}
head -2 ${l}wiki-${HOWMANY}
bzip2 ${l}wiki-shuffled-titles &
$(dirname $0)/jsonify.js ${l}wiki-${HOWMANY} > ${l}wiki-${HOWMANY}.json
done

File Metadata

Mime Type
text/x-shellscript
Storage Engine
blob
Storage Format
Raw Data
Storage Handle
183127
Default Alt Text
shuffle.sh (490 B)

Event Timeline