Skip to content

Instantly share code, notes, and snippets.

@elektret
Last active December 27, 2015 06:09
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save elektret/7280051 to your computer and use it in GitHub Desktop.
Save elektret/7280051 to your computer and use it in GitHub Desktop.
#!/bin/sh
# You need: cuneiform hocr2pdf pdfunite
FILES=$@
merged=""
index=0
for file in $FILES
do
index=`expr $index + 1`
page=`printf page%03d.hocr $index`
cuneiform -l ger -f hocr -o "${page}" "${file}"
done
index=0
for file in $FILES
do
index=`expr $index + 1`
page=`printf page%03d.hocr $index`
output=`printf out%03d.pdf $index`
merged="${merged} ${output}"
hocr2pdf -i "${file}" -o "${output}" < $page
done
pdfunite $merged merged.pdf
rm -rf *.hocr
rm -rf out*.pdf
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment