Skip to content

Instantly share code, notes, and snippets.

@evz
Created July 5, 2016 15:01
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save evz/c923fb5ac584f80345dfaf2ca69fb2ca to your computer and use it in GitHub Desktop.
Save evz/c923fb5ac584f80345dfaf2ca69fb2ca to your computer and use it in GitHub Desktop.
Download and extract data about CPS lead tests
import subprocess
import sys
from io import StringIO
import csv
if __name__ == "__main__":
proc = subprocess.Popen(['java',
'-jar',
'../tabula-java/target/tabula-0.9.0-jar-with-dependencies.jar',
'--pages',
'all',
'-r',
'Individualschool_Beasley.pdf'], stdout=subprocess.PIPE)
output, error = proc.communicate()
output_string = output.decode('utf-8')
begin = output_string.find('Sample ID')
end = output_string.find('EPA ACTION LEVEL')
part_we_want = output_string[begin:end]
first_part = output_string[:begin]
print(first_part)
reader = csv.reader(StringIO(part_we_want))
with open('beasley.csv', 'w') as f:
writer = csv.writer(f)
writer.writerows(reader)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment