Skip to content

Instantly share code, notes, and snippets.

@mwitiderrick
Last active June 14, 2018 16:57
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save mwitiderrick/08769326d6eaae56ceb8d41f22fcb857 to your computer and use it in GitHub Desktop.
Save mwitiderrick/08769326d6eaae56ceb8d41f22fcb857 to your computer and use it in GitHub Desktop.
for i in range(0, 1000):
review = re.sub('[^a-zA-Z]', ' ', df['Review'][i])
review = review.lower()
review = review.split()
lemmatizer = WordNetLemmatizer()
review = [lemmatizer.lemmatize(word) for word in review if not word in set(stopwords.words('english'))]
review = ' '.join(review)
corpus.append(review)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment