Skip to content

Instantly share code, notes, and snippets.

View ankitkv's full-sized avatar
🔬
Doing science

Ankit Vani ankitkv

🔬
Doing science
View GitHub Profile
@ankitkv
ankitkv / crawler.py
Created May 22, 2014 14:54
A simple library to write crawlers. You need to inherit a crawler type and override the 'parse' method. Call self.add_url for URL's to add to the crawl queue.
"""Base framework for simple crawlers"""
__author__ = "Ankit Vani"
__version__ = "0.1"
import bs4
import csv
import httplib
import json
import sys
import urllib2