Created
January 17, 2020 05:24
-
-
Save wandabwa2004/7a6fdf84d0dae5e16059df678f563220 to your computer and use it in GitHub Desktop.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#Bi-gram dictionary generation process | |
dictionary_bi_q1 = gensim.corpora.Dictionary(bigram_mod_q1[data_words_q1]) | |
dictionary_bi_q2 = gensim.corpora.Dictionary(bigram_mod_q2[data_words_q2]) | |
dictionary_bi_q3 = gensim.corpora.Dictionary(bigram_mod_q3[data_words_q3]) | |
dictionary_bi_q4 = gensim.corpora.Dictionary(bigram_mod_q4[data_words_q4]) | |
#Bigram corpus generation process from the dictionary of the subsets. | |
bi_corpus_q1 = [dictionary_bi_q1.doc2bow(doc) for doc in bigram_mod_q1[data_words_q1]] | |
bi_corpus_q2 = [dictionary_bi_q2.doc2bow(doc) for doc in bigram_mod_q2[data_words_q2]] | |
bi_corpus_q3 = [dictionary_bi_q3.doc2bow(doc) for doc in bigram_mod_q3[data_words_q3]] | |
bi_corpus_q4 = [dictionary_bi_q4.doc2bow(doc) for doc in bigram_mod_q4[data_words_q4]] |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment