Skip to content

Instantly share code, notes, and snippets.

@IaroslavR
Last active May 31, 2023 15:10
Show Gist options
  • Star 6 You must be signed in to star a gist
  • Fork 5 You must be signed in to fork a gist
  • Save IaroslavR/834066ba4c0e25a27078 to your computer and use it in GitHub Desktop.
Save IaroslavR/834066ba4c0e25a27078 to your computer and use it in GitHub Desktop.
install last tesseract to Amazon Linux
sudo yum install autoconf aclocal automake
sudo yum install libtool
sudo yum install libjpeg-devel libpng-devel libtiff-devel zlib-devel
cd ~/downloads
wget http://www.leptonica.com/source/leptonica-1.72.tar.gz
tar -zxvf leptonica-1.72.tar.gz
cd leptonica-1.72
./configure
make
sudo make install
cd ..
wget https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz
tar -zxvf 3.04.00.tar.gz
cd tesseract-3.04.00/
./autogen.sh
./configure
make
sudo make install
sudo ldconfig
cd /usr/local/share/tessdata
sudo wget http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz
sudo tar xvf tesseract-ocr-3.02.eng.tar.gz
sudo wget hhttp://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.01.osd.tar.gz
sudo tar xvf tesseract-ocr-3.01.osd.tar.gz
export TESSDATA_PREFIX=/usr/local/share/
sudo mv tesseract-ocr/tessdata/* .
sudo rm tesseract-ocr-3.02.eng.tar.gz
# we need osd for autorotate
sudo rm tesseract-ocr-3.01.osd.tar.gz
nano ~/.bash_profile
# Copy this line to the end: export TESSDATA_PREFIX=/usr/local/share/
# Verify:
tesseract --list-langs
@fpg1503
Copy link

fpg1503 commented Jul 29, 2017

Great guide, except I had to export PKG_CONFIG_PATH=/usr/local/lib/pkgconfig before being able to ./configure tesseract, and I used curl instead of wget because I'm using Amazon Linux Minimal

@Ammar793
Copy link

Great guide, thanks!

had to run autoreconf --force --install before step 16 because was getting "Version mismatch error. This is libtool 2.4.6," error

@Ammar793
Copy link

@Sudarshan-gurav
Copy link

Sudarshan-gurav commented Oct 10, 2019

in my linux ami till not install tesseract please help. I am doing some step

@IaroslavR
Copy link
Author

@Sudarshan-gurav
Now we have better option. It's docker container. See https://github.com/tesseract-ocr/tesseract/wiki/4.0-Docker-Containers for more details

@ricardonsantos
Copy link

My amazon linux machine didn't have C++ complier by default in order to compile tesseract. Solved by installing additional libraries available in AWS:

sudo yum groupinstall "Development Tools"

@KrutarthPujara
Copy link

KrutarthPujara commented Apr 8, 2020

@yasergsi
Copy link

No package aclocal available. on Amazon Linux 2

@codesankalp
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment