Skip to content

Instantly share code, notes, and snippets.

Embed
What would you like to do?
benchmarks docx -> html conversation

AbiWord

abiword --to=html test.docx
  • test.docx : 21 pages
  • method : docx -> html
  • loop : 50
  • total time : 21.803488016129
  • avg. time : 0.43606976032257

wvWare (v 1.2.9)

$ apt-get install wvware



wvWare test.doc > test.html

wvWare cannot convert docx to html. So in this case a "doc" file is used.

  • test.doc : 11 pages
  • method : doc -> html
  • loop : 50
  • total time : 4.7186880111694
  • avg. time : 0.094373760223389

wvWare (v 2)

http://sourceforge.net/projects/wvware/

Warning: all of the utilities (with the exception of wvSummary) should be considered deprecated in favor of using AbiWord. wvWare the Microsoft Word DOC handling library is not deprecated, and is used heavily by (eg.) AbiWord.

wvHtml: convert your Word document into HTML4.0. Use "AbiWord --to=html" instead.

veryDoc (with wine under linux)

wine doc2any.exe test.docx ./test.html
  • test.docx : 21 pages
  • method : docx -> html
  • loop : 50
  • total time : 209.93248105049
  • avg. time : 4.1986496210098
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.