Skip to content

Instantly share code, notes, and snippets.

View vsingh58's full-sized avatar

Venu Kanaparthy vsingh58

  • ESRI
  • California
View GitHub Profile
import nltk
nltk.download()
## use nltk.download() within a Python prompt to
## download the `punkt` data
## Anaconda is recommended, to pick up NumPy, NLTK, etc.
## http://continuum.io/downloads
## this also requires TextBlob/PerceptronTagger
@vsingh58
vsingh58 / 0.setup.sh
Last active August 29, 2015 14:06 — forked from ceteri/0.setup.sh
# using four part files to construct "minitweet"
cat rawtweets/part-0000[1-3] > minitweets
# change log4j properties to WARN to reduce noise during demo
mv conf/log4j.properties.template conf/log4j.properties
vim conf/log4j.properties # Change to WARN
# launch Spark shell REPL
./bin/spark-shell