Skip to content

Instantly share code, notes, and snippets.

@robsannaa
Created February 20, 2019 08:55
Show Gist options
  • Save robsannaa/ed79d4431df763191f9e7e4782fb9d2f to your computer and use it in GitHub Desktop.
Save robsannaa/ed79d4431df763191f9e7e4782fb9d2f to your computer and use it in GitHub Desktop.
from nltk.corpus import stopwords
from nltk.stem.porter import PorterStemmer
corpus = []
for i in range(0, len(yelp)):
review = re.sub('[^a-zA-Z]', ' ', yelp['text'].values[i])
review = review.lower()
review = review.split()
ps = PorterStemmer()
review = [ps.stem(word) for word in review if not word in set(stopwords.words('english'))]
review = ' '.join(review)
corpus.append(review)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment