Skip to content

Instantly share code, notes, and snippets.

@gearmonkey
gearmonkey / icwsm.py
Created July 18, 2011 12:30 — forked from jhofman/icwsm.py
script to scrape pdfs and paper info for icwsm2011
#!/usr/bin/env python
from lxml import etree
from urllib import urlopen
if __name__=='__main__':
url = 'http://www.aaai.org/ocs/index.php/ICWSM/ICWSM11/schedConf/presentations'
tree = etree.parse(urlopen(url), etree.HTMLParser())