henrik/ocr.markdown

## ocr.markdown

      
    Raw
  

              ocr.markdown
            
          
    Install ImageMagick for image conversion:
brew install imagemagick

Install tesseract for OCR:
brew install tesseract --all-languages

Or install without --all-languages and install them manually as needed.
Make sure the input image is a grayscale .tif and fairly large. ~500x150 was too small, while ~2000*500 worked very well.
convert input.png -resize 400% -type Grayscale input.tif

OCR it. The default language is English. Language codes are 3 chars per man tesseract.
tesseract -l eng input.tif output

This creates output.txt.