Created
February 16, 2018 16:16
-
-
Save earino/ec54b01dfe59dbf20f550909625beeb3 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
1. Explain in your words what the unnest_token function does | |
2. Explain your words what the gutenbergr package does | |
3. Explain in your words how sentiment lexicon work | |
4. How does inner_join provide sentiment analysis functionality | |
5. Explain in your words what tf-idf does | |
6. Explain why you may want to do tokenization by bigram | |
7. Please install the following packages, if you have not already: | |
1. tidyverse | |
2. tidytext | |
3. gutenbergr | |
Pick two or more authors that you are familiar with, download their texts using the gutenbergr package, and do a basic analysis of word frequencies and TF-IDF |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment