Skip to content

Instantly share code, notes, and snippets.

@jmandel
Created February 1, 2016 22:04
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save jmandel/6b66b3c31692dde58c5e to your computer and use it in GitHub Desktop.
Save jmandel/6b66b3c31692dde58c5e to your computer and use it in GitHub Desktop.
Download 2016 EHR Certification PDFs and index for search
wget -e robots=off --mirror -l 1 http://confluence.oncprojectracking.org/display/ONCCERT2015/ONC+Health+IT+Certification+Program+2015+Edition+Test+Methods+Home
grep -ri final_rule/pdfs confluence.oncprojectracking.org/ | grep -oP '(https.*?)\.pdf' > pdfs
for i in $(cat pdfs); do wget $i; done
for i in *.pdf; do pdftotext "$i"; done
grep API *.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment