Skip to content

Instantly share code, notes, and snippets.

View gangeshwark's full-sized avatar
🎯
Focusing

Gangeshwar Krishnamurthy gangeshwark

🎯
Focusing
View GitHub Profile
@gangeshwark
gangeshwark / preprocess-twitter.py
Created November 2, 2017 04:44 — forked from tokestermw/preprocess-twitter.py
Python version of Ruby script to preprocess tweets for use in GloVe featurization http://nlp.stanford.edu/projects/glove/
"""
preprocess-twitter.py
python preprocess-twitter.py "Some random text with #hashtags, @mentions and http://t.co/kdjfkdjf (links). :)"
Script for preprocessing tweets by Romain Paulus
with small modifications by Jeffrey Pennington
with translation to Python by Motoki Wu
Translation of Ruby script to create features for GloVe vectors for Twitter data.