Created
December 27, 2019 08:46
-
-
Save blackawa/075e4b89e1245b27cdae959768be3826 to your computer and use it in GitHub Desktop.
pdfbox.clj
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
(ns main | |
(:require [clojure.java.io :as io]) | |
(:import [org.apache.pdfbox.pdmodel PDDocument] | |
[org.apache.pdfbox.text PDFTextStripper])) | |
(defn extract-string [src] | |
(with-open [doc (->> src PDDocument/load)] | |
(->> doc (.getText (PDFTextStripper.))))) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment