You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@poi.apache.org by Nick Burch <ni...@torchbox.com> on 2008/04/14 12:23:45 UTC

Re: Processing word documents recursively

On Thu, 27 Mar 2008, Yury Batrakov wrote:
> I've started implementing a feature to process embedded OLE2 documents. 
> I've slightly modified HWPFDocument(POIFSFileSystem pfilesystem) to 
> accept OLE directory name to look all desired streams there.

I've put something similar to this into trunk now. It works for 
HSSFWorkbook, HWPFDocument and HSLFSlideShow, and there's some information 
on the site under /poifs/.

I've also written some new unit tests for the handling of embeded 
documents, which all pass, so hopefully you'll be able to use this to read 
your embeded documents without any problems

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Re: Processing word documents recursively

Posted by Nick Burch <ni...@torchbox.com>.
On Mon, 14 Apr 2008, Yury Batrakov wrote:
> Oops. I've almost prepared all the patches and already imagined myself
> written down in history of POI, but you finished it before :))

Sorry about that... Someone at apachecon asked me about support for it, 
so I coded it up between two sessions for them!

It's worth checking a recent svn checkout / nightly, to check I did add in 
all the support you had, and your code works nicely with it. If not, do 
please submit patches / bug reports :)

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Re: Processing word documents recursively

Posted by Yury Batrakov <ba...@gmail.com>.
Oops. I've almost prepared all the patches and already imagined myself
written down in history of POI, but you finished it before :)) Waiting
for 3.0.3

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org


Re: Processing word documents recursively

Posted by amp834 <am...@rqinc.com>.
I'm looking for the code mentioned below (I have downloaded poi-3.6).  Has it
made it there?  If not, is there a way I can get it?

I would like to save all embedded files (or "attachments") that are
contained in a .doc file.  (Also in .xls and other office files also, if
there are simple uniform ways of doing so).

Also, some binary files have the "ole package" wrapper, I have a hack from
an old post to unwrap them (haven't tested it yet), but if there is a
routine or library for it, it would make it easier!



On Thu, 27 Mar 2008, Yury Batrakov wrote:
> I've started implementing a feature to process embedded OLE2 documents. 
> I've slightly modified HWPFDocument(POIFSFileSystem pfilesystem) to 
> accept OLE directory name to look all desired streams there.

I've put something similar to this into trunk now. It works for 
HSSFWorkbook, HWPFDocument and HSLFSlideShow, and there's some information 
on the site under /poifs/.

I've also written some new unit tests for the handling of embeded 
documents, which all pass, so hopefully you'll be able to use this to read 
your embeded documents without any problems

Nick

-- 
View this message in context: http://old.nabble.com/Processing-word-documents-recursively-tp15806930p27614853.html
Sent from the POI - User mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@poi.apache.org
For additional commands, e-mail: user-help@poi.apache.org