Skip to content

Instantly share code, notes, and snippets.

View wvwhome's full-sized avatar

Warren Van Wyck wvwhome

  • self
View GitHub Profile
'''loads pdf file in sys.argv[1], extracts URLs, tries to load each URL'''
import urllib
import sys
import PyPDF2
# credits to stackoverflow.com/questions/27744210
def extract_urls(filename):
'''extracts all urls from filename'''
PDFFile = open(filename,'rb')