Skip to content

Instantly share code, notes, and snippets.

@earino
Created February 16, 2018 16:16
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 3 You must be signed in to fork a gist
  • Save earino/ec54b01dfe59dbf20f550909625beeb3 to your computer and use it in GitHub Desktop.
Save earino/ec54b01dfe59dbf20f550909625beeb3 to your computer and use it in GitHub Desktop.
1. Explain in your words what the unnest_token function does
2. Explain your words what the gutenbergr package does
3. Explain in your words how sentiment lexicon work
4. How does inner_join provide sentiment analysis functionality
5. Explain in your words what tf-idf does
6. Explain why you may want to do tokenization by bigram
7. Please install the following packages, if you have not already:
1. tidyverse
2. tidytext
3. gutenbergr
Pick two or more authors that you are familiar with, download their texts using the gutenbergr package, and do a basic analysis of word frequencies and TF-IDF
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment