You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2020/03/24 18:46:03 UTC

[Bug 64261] New: Parse Errors for application/vnd.ms-excel

https://bz.apache.org/bugzilla/show_bug.cgi?id=64261

            Bug ID: 64261
           Summary: Parse Errors for application/vnd.ms-excel
           Product: POI
           Version: unspecified
          Hardware: All
                OS: All
            Status: NEW
          Severity: major
          Priority: P2
         Component: HSSF
          Assignee: dev@poi.apache.org
          Reporter: jaavni@gmail.com
  Target Milestone: ---

Created attachment 37119
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=37119&action=edit
Example test_dropbox_selected.xls

We are trying to extract content from old Excel files using TIKA and we have
encountered this error. If the excel file has a dropbox WITH any element
selected, Apache Tika returns this exception, but if we deselect the element
and save it, Tika extracts the content without any problem:

Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected
RuntimeException from
org.apache.tika.parser.microsoft.OfficeParser@37ddb69aException in thread
"main" org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.microsoft.OfficeParser@37ddb69a at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:282) at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) at
testTika.ExtractContent(testTika.java:183) at
testTika.main(testTika.java:170)Caused by:
org.apache.poi.util.RecordFormatException: Leftover 7 bytes in subrecord data
[15, 00, 12, 00, 12, 00, 01, 00, 11, 20, 00, 00, 00, 00, 00, 00, 00, 00, 00,
00, 00, 00, 0C, 00, 14, 00, 00, 00, 00, 00, 00, 00, 00, 00, 01, 00, 01, 00, 06,
00, 00, 00, 10, 00, 01, 00, 13, 00, EE, 1F, 10, 00, 09, 00, 00, 00, 00, 00, 25,
04, 00, 0A, 00, 05, 00, 05, 00, 05, 07, 00, 00, 00, 18, 00, 00, 00, 00, 00, 00,
01, 00, 00, 00] at
org.apache.poi.hssf.record.ObjRecord.<init>(ObjRecord.java:112) at
java.base/jdk.internal.reflect.NativeConstructorAccessorImpl.newInstance0(Native
Method) at
java.base/jdk.internal.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)

I've attached 2 documents to test.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 64261] Parse Errors for application/vnd.ms-excel

Posted by bu...@apache.org.
https://bz.apache.org/bugzilla/show_bug.cgi?id=64261

--- Comment #1 from Javier <ja...@gmail.com> ---
Created attachment 37120
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=37120&action=edit
Example test_dropbox_NO_selected.xls

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 64261] Parse Errors for application/vnd.ms-excel

Posted by bu...@apache.org.
https://bz.apache.org/bugzilla/show_bug.cgi?id=64261

Dominik Stadler <do...@gmx.at> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |FIXED

--- Comment #4 from Dominik Stadler <do...@gmx.at> ---
As far as I see this was actually fixed some time ago via r1875956.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 64261] Parse Errors for application/vnd.ms-excel

Posted by bu...@apache.org.
https://bz.apache.org/bugzilla/show_bug.cgi?id=64261

--- Comment #3 from Tim Allison <ta...@mitre.org> ---
Found the problem.  Will fix shortly.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 64261] Parse Errors for application/vnd.ms-excel

Posted by bu...@apache.org.
https://bz.apache.org/bugzilla/show_bug.cgi?id=64261

--- Comment #2 from Javier <ja...@gmail.com> ---
https://issues.apache.org/jira/browse/TIKA-3076

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


[Bug 64261] Parse Errors for application/vnd.ms-excel

Posted by bu...@apache.org.
https://bz.apache.org/bugzilla/show_bug.cgi?id=64261

Javier <ja...@gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jaavni@gmail.com

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org