Skip to content

Instantly share code, notes, and snippets.

@aseigneurin
Last active October 13, 2016 03:22
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save aseigneurin/68716befd46e1117785bfed2a9914510 to your computer and use it in GitHub Desktop.
Save aseigneurin/68716befd46e1117785bfed2a9914510 to your computer and use it in GitHub Desktop.
#!/bin/bash -e
if [ ! -d data/wikipedia-pagecounts-hours ]; then
mkdir -p data/wikipedia-pagecounts-hours
fi
cd data/wikipedia-pagecounts-hours
yyyy=2014
MM=06
dd=19
for hh in 00 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23
do
wget https://dumps.wikimedia.org/other/pagecounts-raw/${yyyy}/${yyyy}-${MM}/pagecounts-${yyyy}${MM}${dd}-${hh}0000.gz
gunzip pagecounts-${yyyy}${MM}${dd}-${hh}0000.gz
done
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment