Skip to content

Instantly share code, notes, and snippets.

@jonasagx
Forked from halfak/gist:1a0a9e0c754e12f2686a2eaa8046667d
Last active January 20, 2017 17:55
Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save jonasagx/a4c8bb1a4d1aca29ff8c7c2e5f2752dd to your computer and use it in GitHub Desktop.
Save jonasagx/a4c8bb1a4d1aca29ff8c7c2e5f2752dd to your computer and use it in GitHub Desktop.
mkdir venv
cd venv
virtualenv 3.4 -p $(which python3) --system-site-packages
cd ..
source venv/3.4/bin/activate
mkdir projects
cd projects
git clone https://jonasagx@github.com/wiki-ai/revscoring
cd revscoring
python -m nltk.downloader stopwords
pip install -r requirements.txt
pip install revscoring==1.3.4
cd ..
git clone https://jonasagx@github.com/wiki-ai/editquality
cd editquality
pip install -r requirements.txt
make datasets/enwiki.labeled_revisions.20k_2015.json
cat datasets/enwiki.labeled_revisions.20k_2015.json | shuf -n 1000 | revscoring extract editquality.feature_lists.enwiki.damaging --host https://en.wikipedia.org --profile enwiki.damaging.profile.md > /dev/null
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment