Skip to content

Instantly share code, notes, and snippets.

@ryoakg
Last active April 20, 2016 14:12
Show Gist options
  • Save ryoakg/67c17cd5e57bb5376f35 to your computer and use it in GitHub Desktop.
Save ryoakg/67c17cd5e57bb5376f35 to your computer and use it in GitHub Desktop.
Tikaでrtf読む.文字の色とか大きさは取れなかった.残念
;;; `boot repl` to go
(set-env! :dependencies '[[org.apache.tika/tika-parsers "1.10"]])
(import '(org.apache.tika metadata.Metadata
parser.ParseContext
parser.rtf.RTFParser)
'(java.io StringWriter FileInputStream)
'(javax.xml.transform sax.SAXTransformerFactory
stream.StreamResult
OutputKeys))
(let [metadata (Metadata.)
sw (StringWriter.)
factory (cast SAXTransformerFactory (SAXTransformerFactory/newInstance))
handler (doto (.newTransformerHandler factory)
(.. getTransformer (setOutputProperty OutputKeys/METHOD "xml"))
(.. getTransformer (setOutputProperty OutputKeys/INDENT "no"))
(.setResult (StreamResult. sw)))]
(. (RTFParser. ) parse (FileInputStream. "1.rtf") handler metadata (ParseContext.))
(-> sw .toString prn)
)
@ryoakg
Copy link
Author

ryoakg commented Oct 14, 2015

http://docs.oracle.com/javase/7/docs/api/javax/xml/transform/OutputKeys.html
で、出力を設定できるみたい

色とか取るのは無理っぽそう

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment