Skip to content

Instantly share code, notes, and snippets.

@earino
Last active April 22, 2020 04:23
Show Gist options
  • Save earino/cd06533faac7a19c025e029196d3e876 to your computer and use it in GitHub Desktop.
Save earino/cd06533faac7a19c025e029196d3e876 to your computer and use it in GitHub Desktop.
1. Explain in your words what the unnest_token function does
2. Explain your words what the gutenbergr package does
3. Explain in your words how sentiment lexicon work
4. How does inner_join provide sentiment analysis functionality
5. Explain in your words what tf-idf does
6. Explain why you may want to do tokenization by bigram
7. Please install the following packages, if you have not already:
1. tidyverse
2. tidytext
3. gutenbergr
Pick two or more authors that you are familiar with, download their texts using the gutenbergr package, and do a
basic analysis of word frequencies and TF-IDF
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment