Skip to content

Instantly share code, notes, and snippets.

@webplumbr
Last active September 3, 2015 22:09
Show Gist options
  • Save webplumbr/64a150f595bcbac5b49c to your computer and use it in GitHub Desktop.
Save webplumbr/64a150f595bcbac5b49c to your computer and use it in GitHub Desktop.
How to make a searchable PDF
#
# This pertains to making a searchable PDF
#
# (1) Make sure your scanner setting is set to either Greyscale or Full-Colour and 200 dpi.
# (2) You need to have the following installed on your Ubuntu (Debian) system
#
sudo apt-get install imagemagick tesseract-ocr
# (3) Once you get the scanned PDF sent to your Univ e-mail address, download it and run the following
convert -normalize -density 200 -depth 8 <scanned.pdf> <newname.tif>
tesseract <newname.tif> <newpdfname> pdf
# (4) Open the newly created pdf and start searching
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment