Skip to content

Instantly share code, notes, and snippets.

@saveyak
Created September 16, 2020 00:28
Show Gist options
  • Save saveyak/313d91847f4645d5f8769d43b7f848b9 to your computer and use it in GitHub Desktop.
Save saveyak/313d91847f4645d5f8769d43b7f848b9 to your computer and use it in GitHub Desktop.
Using Python's Tika package to read hundreds of pages of PDFs and create a dataframe
Display the source blob
Display the rendered blob
Raw
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment