Skip to content

Instantly share code, notes, and snippets.

@akashjobanputra
Forked from IaroslavR/gist:834066ba4c0e25a27078
Last active July 3, 2018 15:56
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save akashjobanputra/2c54acb04e75d3967abd7279303ffa8a to your computer and use it in GitHub Desktop.
Save akashjobanputra/2c54acb04e75d3967abd7279303ffa8a to your computer and use it in GitHub Desktop.
Install Tesseract 3.04 on Amazon AWS AMI
sudo yum install autoconf aclocal automake
sudo yum install libtool
sudo yum install libjpeg-devel libpng-devel libtiff-devel zlib-devel
cd ~/downloads
wget http://www.leptonica.com/source/leptonica-1.72.tar.gz
tar -zxvf leptonica-1.72.tar.gz
cd leptonica-1.72
./configure
make
sudo make install
cd ..
wget https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz
tar -zxvf 3.04.00.tar.gz
cd tesseract-3.04.00/
./autogen.sh
./configure
make
sudo make install
sudo ldconfig
cd /usr/local/share/tessdata
sudo wget https://github.com/tesseract-ocr/tessdata/raw/3.04.00/eng.traineddata
sudo wget https://github.com/tesseract-ocr/tessdata/raw/3.04.00/osd.traineddata
nano ~/.bash_profile
# Copy this line to the end: export TESSDATA_PREFIX=/usr/local/share/
# Update variables:
source ~/.bash_profile
# Verify:
tesseract --list-langs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment