-
-
Save CyrilRJK/68b7b81f505c06fc8b3ecc8e346d48d2 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
tokenizer = cv.build_tokenizer() | |
tokenized_docs = [tokenizer(doc) for doc in df['processed docs']] |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment