Monday, November 13, 2006

Extracting Images from MSWord Documents

A new post is up about how to extract images from word docs.
Comments may be left here.

Links to this post


At 22 November, 2006 17:36, Anonymous Anonymous said...

javax.xml.parsers.ParserConfigurationException: AElfred parser is non-validating
at com.icl.saxon.aelfred.SAXParserFactoryImpl.newSAXParser(
at com.doylecentral.word.BinaryExtractor.parseXml(
at com.doylecentral.word.FileTester.main(

At 22 November, 2006 18:19, Anonymous Anonymous said...

The above problem can be solved by removing saxon from the classpath.

At 23 May, 2007 04:37, Blogger CornĂ© said...

A nice but somewhat difficult way to get the images from word.
There is a much simpler way:
save your document as a html page. Word wil create a directory containing all images used in the document saved as png/gif/jpg

At 29 June, 2007 06:28, Anonymous Anonymous said...

Absolutely brilliant and soooo simple.



Post a Comment

Subscribe to Post Comments [Atom]

<< Home

Links to this post on: