Skip to content

Instantly share code, notes, and snippets.

@hasantayyar
Last active January 4, 2016 03:09
Show Gist options
  • Save hasantayyar/8559594 to your computer and use it in GitHub Desktop.
Save hasantayyar/8559594 to your computer and use it in GitHub Desktop.
benchmarks docx -> html conversation

AbiWord

abiword --to=html test.docx
  • test.docx : 21 pages
  • method : docx -> html
  • loop : 50
  • total time : 21.803488016129
  • avg. time : 0.43606976032257

wvWare (v 1.2.9)

$ apt-get install wvware



wvWare test.doc > test.html

wvWare cannot convert docx to html. So in this case a "doc" file is used.

  • test.doc : 11 pages
  • method : doc -> html
  • loop : 50
  • total time : 4.7186880111694
  • avg. time : 0.094373760223389

wvWare (v 2)

http://sourceforge.net/projects/wvware/

Warning: all of the utilities (with the exception of wvSummary) should be considered deprecated in favor of using AbiWord. wvWare the Microsoft Word DOC handling library is not deprecated, and is used heavily by (eg.) AbiWord.

wvHtml: convert your Word document into HTML4.0. Use "AbiWord --to=html" instead.

veryDoc (with wine under linux)

wine doc2any.exe test.docx ./test.html
  • test.docx : 21 pages
  • method : docx -> html
  • loop : 50
  • total time : 209.93248105049
  • avg. time : 4.1986496210098
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment