You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2008/08/04 16:26:25 UTC

DO NOT REPLY [Bug 45535] New: poi-3.5-beta1-20080718.jar - content from an embedded document of an Office 2003 doc document is not extracted.

https://issues.apache.org/bugzilla/show_bug.cgi?id=45535

           Summary: poi-3.5-beta1-20080718.jar - content from an embedded
                    document of an Office 2003 doc document is not
                    extracted.
           Product: POI
           Version: unspecified
          Platform: PC
        OS/Version: Windows Server 2003
            Status: NEW
          Severity: normal
          Priority: P2
         Component: POI Overall
        AssignedTo: dev@poi.apache.org
        ReportedBy: bruno.cartier-millon@laposte.net


Created an attachment (id=22358)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=22358)
Contains JUnit test class and documents used for testing.

When an office 2003 word document contains an embedded document, the content of
this embedded document is not extracted.
Find in attachments the JUnit test class and the documents used for testing.

We expected to extract the words "testdoc" and "test phrase".


notes on the attached document:

- the document "classic_EmbeddedObject_excel.doc" contains an xls embedded
document that contains the words "testdoc" and "test phrase"

- the document "classic_EmbeddedObject_ppt.doc" contains a ppt embedded
document that contains the words "testdoc" and "test phrase"

- the document "classic_EmbeddedObject_word.doc" contains a doc embedded
document that contains the words "testdoc" and "test phrase"

"TestUnitPoi35Filter.java" is the JUnit class.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 45535] poi-3.5-beta1-20080718.jar - content from an embedded document of an Office 2003 doc document is not extracted.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45535


Nick Burch <ni...@torchbox.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |WONTFIX




--- Comment #1 from Nick Burch <ni...@torchbox.com>  2008-08-04 07:33:07 PST ---
You'll need to pass the embeded OLE2 documents off to the appropriate h##f code
to process them.

POI doesn't automatically re-curse into embeded documents.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 45535] poi-3.5-beta1-20080718.jar - content from an embedded document of an Office 2003 doc document is not extracted.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45535





--- Comment #2 from xtrim <br...@laposte.net>  2008-08-04 07:43:21 PST ---
Created an attachment (id=22359)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=22359)
The attachments contains new examples of documents used for testing.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org