Skip to content

Instantly share code, notes, and snippets.

@richiefrost
Created July 24, 2019 08:02
Show Gist options
  • Save richiefrost/62d84d86addb147b8b8e6ca56d4703ae to your computer and use it in GitHub Desktop.
Save richiefrost/62d84d86addb147b8b8e6ca56d4703ae to your computer and use it in GitHub Desktop.
Dockerfile for creating a textract Docker image
FROM ubuntu:latest
RUN apt-get update && \
apt-get install python-dev libxml2-dev libxslt1-dev antiword unrtf poppler-utils pstotext tesseract-ocr \
flac ffmpeg lame libmad0 libsox-fmt-mp3 sox libjpeg-dev swig libpulse-dev python-pip -y && \
pip install textract
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment