You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2008/08/05 16:49:11 UTC

DO NOT REPLY [Bug 45565] New: poi-3.5-beta1-20080718.jar - content from a TextBox object of a 2003 xls document is not extracted.

https://issues.apache.org/bugzilla/show_bug.cgi?id=45565

           Summary: poi-3.5-beta1-20080718.jar - content from a TextBox
                    object of a 2003 xls document is not extracted.
           Product: POI
           Version: unspecified
          Platform: PC
        OS/Version: Windows Server 2003
            Status: NEW
          Severity: normal
          Priority: P2
         Component: POI Overall
        AssignedTo: dev@poi.apache.org
        ReportedBy: xtrimxtrim@yahoo.fr


Created an attachment (id=22388)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=22388)
Contains JUnit test class and documents used for testing.

The text contained in a TextBox inserted/created in an excel 2003 document is
not extracted.
Find in attachments the JUnit test class and the document used for testing.
We expected to extract the words "testdoc" and "test phrase".

Notes on the attached documents:

- the document "classic.TextInTextBox.xls" contains the words "testdoc" and
"test phrase" in a TextBox inserted in the document.


"TestUnitPoi35Filter.java" is the JUnit class.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 45565] poi-3.5-beta1-20080718.jar - content from a TextBox object of a 2003 xls document is not extracted.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45565

--- Comment #3 from MaryAubaun@gmail.com 2012-04-20 22:39:45 UTC ---
I get the same problem with the event based parsers, for both the 97-2003
formats and the 2007/xslx formats.  If anyone can give an idea what code to
add, I may be able to put it in, at least into the event-based one, and post
the code.

Also would like to get hidden text and revision marks, as settable options, and
can write the code for it if someone can point me in the right direction.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 45565] poi-3.5-beta1-20080718.jar - content from a TextBox object of a 2003 xls document is not extracted.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45565

Dominik Stadler <do...@gmx.at> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
          Component|POI Overall                 |XSSF

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 45565] poi-3.5-beta1-20080718.jar - content from a TextBox object of a 2003 xls document is not extracted.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45565





--- Comment #2 from Jose M. Sánchez <jm...@autentia.com>  2009-01-29 04:31:05 PST ---
With 3.2-FINAL to 3.5-beta1 versions also not extracts the contents of the text
boxes in word 97 documents.

As in the previous comment, we have uploaded a JUnit test, that reproduces the
error with WordExtractor and the ExtractorFactory.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 45565] poi-3.5-beta1-20080718.jar - content from a TextBox object of a 2003 xls document is not extracted.

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=45565





--- Comment #1 from Jose M. Sánchez <jm...@autentia.com>  2009-01-29 04:29:55 PST ---
Created an attachment (id=23191)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=23191)
Contains JUnit test class and a document used for testing. 


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org