Skip to content

Instantly share code, notes, and snippets.

@chrismattmann
Created April 18, 2019 16:59
Show Gist options
  • Save chrismattmann/b30156974cc34af6270bd4b95f97a419 to your computer and use it in GitHub Desktop.
Save chrismattmann/b30156974cc34af6270bd4b95f97a419 to your computer and use it in GitHub Desktop.
Parse the Mueller Report
from tika import parser
parsed = parser.from_file('Mueller-Report-v1-v2.pdf')
# Metadata
print(parsed["metadata"])
# Content
print(parsed["content"])
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment