Skip to content

Instantly share code, notes, and snippets.

@giansalex
Forked from IaroslavR/gist:834066ba4c0e25a27078
Last active May 17, 2020 00:18
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save giansalex/7abc8514fa8c80a3771f56283fd41fd7 to your computer and use it in GitHub Desktop.
Save giansalex/7abc8514fa8c80a3771f56283fd41fd7 to your computer and use it in GitHub Desktop.
Install last tesseract to Amazon Linux from scripts
sudo yum install -y autoconf aclocal automake
sudo yum install -y libtool
sudo yum install -y libjpeg-devel libpng-devel libtiff-devel zlib-devel
cd ~/downloads
wget http://www.leptonica.com/source/leptonica-1.72.tar.gz
tar -zxvf leptonica-1.72.tar.gz
cd leptonica-1.72
./configure
make
sudo make install
cd ..
wget https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz
tar -zxvf 3.04.00.tar.gz
cd tesseract-3.04.00/
./autogen.sh
./configure
make
sudo make install
sudo ldconfig
cd /usr/local/share/tessdata
sudo wget -O tesseract-ocr-3.02.eng.tar.gz https://src.fedoraproject.org/repo/pkgs/tesseract/tesseract-ocr-3.02.eng.tar.gz/3562250fe6f4e76229a329166b8ae853/tesseract-ocr-3.02.eng.tar.gz
sudo tar xvf tesseract-ocr-3.02.eng.tar.gz
sudo wget -O tesseract-ocr-3.01.osd.tar.gz https://src.fedoraproject.org/repo/pkgs/tesseract/tesseract-ocr-3.01.osd.tar.gz/683486e01f5b87c17f2f5815f770ccb3/tesseract-ocr-3.01.osd.tar.gz
sudo tar xvf tesseract-ocr-3.01.osd.tar.gz
export TESSDATA_PREFIX=/usr/local/share/
sudo mv tesseract-ocr/tessdata/* .
sudo rm tesseract-ocr-3.02.eng.tar.gz
# we need osd for autorotate
sudo rm tesseract-ocr-3.01.osd.tar.gz
echo "export TESSDATA_PREFIX=/usr/local/share/" >> ~/.bash_profile
# Verify:
tesseract --list-langs
@giansalex
Copy link
Author

wget -O install.sh https://gist.githubusercontent.com/giansalex/7abc8514fa8c80a3771f56283fd41fd7/raw
chmod +x install.sh 
./install.sh

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment