Skip to content

Instantly share code, notes, and snippets.

@wandabwa2004
Created January 17, 2020 05:24
Show Gist options
  • Save wandabwa2004/7a6fdf84d0dae5e16059df678f563220 to your computer and use it in GitHub Desktop.
Save wandabwa2004/7a6fdf84d0dae5e16059df678f563220 to your computer and use it in GitHub Desktop.
#Bi-gram dictionary generation process
dictionary_bi_q1 = gensim.corpora.Dictionary(bigram_mod_q1[data_words_q1])
dictionary_bi_q2 = gensim.corpora.Dictionary(bigram_mod_q2[data_words_q2])
dictionary_bi_q3 = gensim.corpora.Dictionary(bigram_mod_q3[data_words_q3])
dictionary_bi_q4 = gensim.corpora.Dictionary(bigram_mod_q4[data_words_q4])
#Bigram corpus generation process from the dictionary of the subsets.
bi_corpus_q1 = [dictionary_bi_q1.doc2bow(doc) for doc in bigram_mod_q1[data_words_q1]]
bi_corpus_q2 = [dictionary_bi_q2.doc2bow(doc) for doc in bigram_mod_q2[data_words_q2]]
bi_corpus_q3 = [dictionary_bi_q3.doc2bow(doc) for doc in bigram_mod_q3[data_words_q3]]
bi_corpus_q4 = [dictionary_bi_q4.doc2bow(doc) for doc in bigram_mod_q4[data_words_q4]]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment