Skip to content

Instantly share code, notes, and snippets.

@andyreagan
Last active August 29, 2015 14:16
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save andyreagan/dddabe583e223b929f9c to your computer and use it in GitHub Desktop.
Save andyreagan/dddabe583e223b929f9c to your computer and use it in GitHub Desktop.
get the google books corpus
for corpus in 0 1 2 3 4 5 6 7 8 9 a b c d e f g h i j k l m n o other p pos punctuation q r s t u v w x y z
do
wget http://storage.googleapis.com/books/ngrams/books/googlebooks-eng-all-1gram-20120701-${corpus}.gz
done
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment