Skip to content

Instantly share code, notes, and snippets.

View oiuww09fn's full-sized avatar

bob oiuww09fn

View GitHub Profile
# pdf pdftotext
# doc antiword
# docx docx
# excel xlrd
from subprocess import Popen, PIPE
from docx import opendocx, getdocumenttext
#http://stackoverflow.com/questions/5725278/python-help-using-pdfminer-as-a-library
from pdfminer.pdfinterp import PDFResourceManager, PDFPageInterpreter