Skip to content

Instantly share code, notes, and snippets.

View veekaybee's full-sized avatar
💫
in the latent space

Vicki Boykis veekaybee

💫
in the latent space
View GitHub Profile

Keybase proof

I hereby claim:

  • I am veekaybee on github.
  • I am veekaybee (https://keybase.io/veekaybee) on keybase.
  • I have a public key ASC1BmRUMCaXHMnJ2DzEnxIyypbZqJmYGJIbCxhhrrSZKgo

To claim this, I am signing this object:

@veekaybee
veekaybee / nltk-corpora.md
Created March 17, 2017 02:09
Deep-diving into NLTK corpora

# What is NLTK? 

A natural-language processing library written in Python, used for tons of applications, including analyzing [movie and restaurant reviews](http://crowdsourcing-class.org/assignments/downloads/pak-paroubek.pdf). 
More on that [here](https://github.com/nltk/nltk/wiki/Sentiment-Analysis).

[Examples](http://www.laurentluce.com/posts/twitter-sentiment-analysis-using-python-and-nltk/) of how to do sentiment analysis in Python. 
Note that tweets here are hand-labelled with regards to sentiment.
from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()
le.fit(['a', 'b', 'c', 'c’])
dict(zip(le.classes_, range(len(le.classes_))))
>>>{'a': 0, 'b': 1, 'c': 2}
@veekaybee
veekaybee / mkdwn.md
Last active August 29, 2015 14:24
If you write a lot of stuff in Word, Markdown might be a better option for you

Markdown is a text editing language, like HTML. If you use Word or HTML to write specs and documentation, Markdown may be a better, more lightweight option for you. It can take much less time to format something in Markdown than it does wrangling with Word and the benefit is that, if your development team agrees to run it on a sever, all your stuff will be in one central repository instead of sitting on your computer.

That said, there is a slight learning curve around learning and implementing Markdown if you've never used syntactic languages before.

Here are the recommendations I've come across:

  • Markdown does not auto-generate tables of contents. You have to do it yourself.