Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save cheeseonamonkey/f00a5862ced0ca38c173078db33ffbb7 to your computer and use it in GitHub Desktop.
Save cheeseonamonkey/f00a5862ced0ca38c173078db33ffbb7 to your computer and use it in GitHub Desktop.
simple bash script for downloading the Google word2vec model (https://code.google.com/archive/p/word2vec/) from Google-Drive
#!/bin/bash
# usage:
# first make the file executable
# ./word2vec-download300model.sh output-file
OUTPUT=$( wget --save-cookies cookies.txt --keep-session-cookies --no-check-certificate 'https://docs.google.com/uc?export=download&id=0B7XkCwpI5KDYNlNUTTlSS21pQmM' -O- | sed -rn 's/.*confirm=([0-9A-Za-z_]+).*/Code: \1\n/p' )
CODE=${OUTPUT##*Code: }
echo $CODE
F='GoogleNews-vectors-negative300.bin.gz'
if [ -z "$1" ]
then
OUT_F=$F
else
OUT_F=$1/$F
fi
echo $OUT_F
wget --load-cookies cookies.txt 'https://docs.google.com/uc?export=download&confirm='$CODE'&id=0B7XkCwpI5KDYNlNUTTlSS21pQmM' -O $OUT_F
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment