Skip to content

Instantly share code, notes, and snippets.

@pstaender
Last active September 27, 2022 04:26
Show Gist options
  • Save pstaender/77efc869239493d3c602f843149126ef to your computer and use it in GitHub Desktop.
Save pstaender/77efc869239493d3c602f843149126ef to your computer and use it in GitHub Desktop.
OpenSource OCR for Mac OS
#!/bin/bash
# For Macs:
# brew install tesseract --with-all-languages
echo "Hint: Prepare all png images as valid OCR input before starting convert process"
if [[ "$1" != "" ]]; then
files="$1"
else
files=*.png
fi
if [[ "$2" != "" ]]; then
selected_language="-l eng+deu"
else
selected_language=""
fi
for f in $files
do
for f in $FILES
do
file_basename=$(basename $f .png)
echo "## CONVERTING $f > ${file_basename}.pdf"
tesseract $selected_language $f $file_basename pdf
done
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment