Skip to content

Instantly share code, notes, and snippets.

View thadguidry's full-sized avatar
🏠
Home, likely researching

Thad Guidry thadguidry

🏠
Home, likely researching
View GitHub Profile
# code to take a PDF and scrape address information. Note that this particular script will
# only work using the specific PDF formatting my PDF had so you can use as a guide but
# it will definitely not work on your PDF!
from pdfminer.pdfinterp import PDFResourceManager, PDFPageInterpreter
from pdfminer.converter import HTMLConverter
from pdfminer.converter import TextConverter
from pdfminer.layout import LAParams
from pdfminer.pdfpage import PDFPage