Skip to content

Instantly share code, notes, and snippets.

@gadgetking-y
Created November 27, 2021 05:43
Show Gist options
  • Save gadgetking-y/09ac8c28a2b5b8f6ded00d0d0fa9f63a to your computer and use it in GitHub Desktop.
Save gadgetking-y/09ac8c28a2b5b8f6ded00d0d0fa9f63a to your computer and use it in GitHub Desktop.
import glob
import sys
from pdfminer.high_level import extract_text
list_of_files = glob.glob(/path/to/pdf/*.pdf)
for pdf_filename in list_of_files:
print(f'=========== {pdf_filename} =============')
text = extract_text(pdf_filename, page_numbers=0) # 0 = Page.1 になる
print(text)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment