Skip to content

Instantly share code, notes, and snippets.

Created December 14, 2021 22:48
Show Gist options
  • Save letorbi/31a6ea3cdc41ebc80ad2c7f5351c0556 to your computer and use it in GitHub Desktop.
Save letorbi/31a6ea3cdc41ebc80ad2c7f5351c0556 to your computer and use it in GitHub Desktop.
A shell script that tries to remove exploits and malware from PDFs
# References:
TEMPFILE=$(mktemp /tmp/pdfsanitize.XXXXXXXXX)
# Re-write PDF and uncompress any images to remove image meta data (EXIF)
gs -sDEVICE=pdfwrite -dColorConversionStrategy=/LeaveColorUnchanged -dPassThroughJPEGImages=false -dPassThroughJPXImages=false -dEncodeColorImages=false -dEncodeGrayImages=false -dEncodeMonoImages=false -dNOPAUSE -dBATCH -sOutputFile="$TEMPFILE" "$1"
# Re-compress images and downgrade PDF version to destroy (hopefully) all malware and exploits
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/ebook -dNOPAUSE -dBATCH -sOutputFile="$OUTFILE" "$TEMPFILE"
# Clean up
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment