Skip to content

Instantly share code, notes, and snippets.

@JasonKessler
Last active March 9, 2018 22:58
Show Gist options
  • Save JasonKessler/ed92631c9994755470dd43e737a760c2 to your computer and use it in GitHub Desktop.
Save JasonKessler/ed92631c9994755470dd43e737a760c2 to your computer and use it in GitHub Desktop.
import pandas as pd
import scattertext as st
reviews_df = pd.read_csv('https://github.com/JasonKessler/ICLR18ReviewVis/raw/master/iclr2018_reviews.csv.bz2')
reviews_df['parse'] = reviews_df['review'].apply(st.whitespace_nlp_with_sentences)
corpus = (st.CorpusFromParsedDocuments(reviews_df, category_col = 'decision', parsed_col = 'parse')
.build().remove_categories(['Workshop']))
html = st.produce_scattertext_explorer(corpus,
category='Accept', not_categories=['Reject'],
transform = st.Scalers.dense_rank,
term_scorer = st.RankDifference(),
metadata = corpus.get_df()['metadata'])
open('output/accept_reject_dense.html', 'wb').write(html.encode('utf-8'))
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment