Skip to content

Instantly share code, notes, and snippets.

@JasonKessler
Created February 7, 2018 02:15
Show Gist options
  • Save JasonKessler/5e147f3b604303ec6867a84b019b3957 to your computer and use it in GitHub Desktop.
Save JasonKessler/5e147f3b604303ec6867a84b019b3957 to your computer and use it in GitHub Desktop.
Scraping Openreview.net for ICLR Reviews (1)
url = 'https://openreview.net/notes?invitation=ICLR.cc%2F2018%2FConference%2F-%2FBlind_Submission&offset=0&limit=1000'
df = pd.DataFrame(requests.get(url).json()['notes']) # Each row in this data frame is a paper.
forum_content = []
for i, forum_id in list(enumerate(df.forum)): # Each forum_id is a review, comment, or acceptance decision about a paper.
forum_content.append(requests.get('https://openreview.net/notes?forum={}&trash=true'.format(forum_id)).json())
time.sleep(.3)
df['forumContent'] = pd.Series(forum_content)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment