You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@poi.apache.org by Sam Li <sa...@gmail.com> on 2012/04/03 09:04:37 UTC

extracting text INCLUDING textbox from docx, xlsx, etc formats

I'm currently unable to extract all the text from the office 2007 office xml formats; namely textboxes. What I really need is just a word count but the word counter isn't very accurate. Any ideas on how to solve this problem? I know that the regular .doc files that contain textboxes can be extracted fine. Just having trouble with the x files. 
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org