You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@poi.apache.org by Peter Brouwer <pe...@gmail.com> on 2009/03/26 10:03:18 UTC

Old text from word document shows up when I extract text

If I extract text from a word document (complex saved), I also get the text
which has been saved previously and is not valid anymore.

I use 3.2 FINAL and tried 3.5beta5 but both have the same problem.

I use:

HWPFDocument doc = new HWPFDocument(stream);
WordExtractor extractor = new WordExtractor(doc);
String text=extractor.getText();

Is there a way around this problem.

Thnx,
Peter