- GraphicsMagick
- poppler-utils
- open office -- will this work with Open Office on server?
gem 'docsplit'
a = Docsplit.extract_text '/path/to/file' #creates a pdf at an obscure (temp?) file location
content = `pdftotext #{a.first} -` #dash sends results to stdout
- Possibly need a script to remove pdfs generated by this process
- seems like a lot of overhead to grab text from a ppt . . . but it seems to work