Skip to content

Instantly share code, notes, and snippets.

@chrisjz
Forked from IaroslavR/gist:834066ba4c0e25a27078
Last active October 12, 2017 05:54
Show Gist options
  • Save chrisjz/9dad487c341c754d6d6099bd816f749d to your computer and use it in GitHub Desktop.
Save chrisjz/9dad487c341c754d6d6099bd816f749d to your computer and use it in GitHub Desktop.
install last tesseract to Amazon Linux
sudo yum install autoconf aclocal automake
sudo yum install libtool
sudo yum install libjpeg-devel libpng-devel libtiff-devel zlib-devel
cd ~/downloads
wget http://www.leptonica.com/source/leptonica-1.72.tar.gz
tar -zxvf leptonica-1.72.tar.gz
cd leptonica-1.72
./configure
make
sudo make install
cd ..
wget https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz
tar -zxvf 3.04.00.tar.gz
cd tesseract-3.04.00/
./autogen.sh
./configure
make
sudo make install
sudo ldconfig
cd /usr/local/share/tessdata
sudo wget -O tesseract-ocr-3.02.eng.tar.gz https://sourceforge.net/projects/tesseract-ocr-alt/files/tesseract-ocr-3.02.eng.tar.gz/download
sudo tar xvf tesseract-ocr-3.02.eng.tar.gz
sudo wget -O tesseract-ocr-3.01.osd.tar.gz https://sourceforge.net/projects/tesseract-ocr-alt/files/tesseract-ocr-3.01.osd.tar.gz/download
sudo tar xvf tesseract-ocr-3.01.osd.tar.gz
export TESSDATA_PREFIX=/usr/local/share/
sudo mv tesseract-ocr/tessdata/* .
sudo rm tesseract-ocr-3.02.eng.tar.gz
# we need osd for autorotate
sudo rm tesseract-ocr-3.01.osd.tar.gz
# Copying this line to the end of ~/.bash_profile: export TESSDATA_PREFIX=/usr/local/share/
sudo echo 'export TESSDATA_PREFIX=/usr/local/share/' >> ~/.bash_profile
# Verify:
tesseract --list-langs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment