You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2008/08/04 16:26:25 UTC
DO NOT REPLY [Bug 45535] New: poi-3.5-beta1-20080718.jar - content
from an embedded document of an Office 2003 doc document is not extracted.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45535
Summary: poi-3.5-beta1-20080718.jar - content from an embedded
document of an Office 2003 doc document is not
extracted.
Product: POI
Version: unspecified
Platform: PC
OS/Version: Windows Server 2003
Status: NEW
Severity: normal
Priority: P2
Component: POI Overall
AssignedTo: dev@poi.apache.org
ReportedBy: bruno.cartier-millon@laposte.net
Created an attachment (id=22358)
--> (https://issues.apache.org/bugzilla/attachment.cgi?id=22358)
Contains JUnit test class and documents used for testing.
When an office 2003 word document contains an embedded document, the content of
this embedded document is not extracted.
Find in attachments the JUnit test class and the documents used for testing.
We expected to extract the words "testdoc" and "test phrase".
notes on the attached document:
- the document "classic_EmbeddedObject_excel.doc" contains an xls embedded
document that contains the words "testdoc" and "test phrase"
- the document "classic_EmbeddedObject_ppt.doc" contains a ppt embedded
document that contains the words "testdoc" and "test phrase"
- the document "classic_EmbeddedObject_word.doc" contains a doc embedded
document that contains the words "testdoc" and "test phrase"
"TestUnitPoi35Filter.java" is the JUnit class.
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 45535] poi-3.5-beta1-20080718.jar - content from
an embedded document of an Office 2003 doc document is not extracted.
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45535
Nick Burch <ni...@torchbox.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |RESOLVED
Resolution| |WONTFIX
--- Comment #1 from Nick Burch <ni...@torchbox.com> 2008-08-04 07:33:07 PST ---
You'll need to pass the embeded OLE2 documents off to the appropriate h##f code
to process them.
POI doesn't automatically re-curse into embeded documents.
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 45535] poi-3.5-beta1-20080718.jar - content from
an embedded document of an Office 2003 doc document is not extracted.
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45535
--- Comment #2 from xtrim <br...@laposte.net> 2008-08-04 07:43:21 PST ---
Created an attachment (id=22359)
--> (https://issues.apache.org/bugzilla/attachment.cgi?id=22359)
The attachments contains new examples of documents used for testing.
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org