You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/10/17 10:46:58 UTC
[jira] [Reopened] (TIKA-2120) NegativeArraySizeException on a
password protected Excel workbook
[ https://issues.apache.org/jira/browse/TIKA-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison reopened TIKA-2120:
-------------------------------
I found five triggering xls files in our most recent [regression reports|https://github.com/tballison/share/blob/master/tika_comparisons/reports_1_14-trunk_vs_1_13.zip?raw=true].
{noformat}
java.lang.NegativeArraySizeException
at org.apache.poi.ddf.EscherBSERecord.fillFields(EscherBSERecord.java:84)
at org.apache.poi.ddf.EscherContainerRecord.fillFields(EscherContainerRecord.java:77)
at org.apache.poi.ddf.EscherContainerRecord.fillFields(EscherContainerRecord.java:77)
at org.apache.poi.hssf.record.AbstractEscherHolderRecord.convertToEscherRecords(AbstractEscherHolderRecord.java:78)
at org.apache.poi.hssf.record.AbstractEscherHolderRecord.decode(AbstractEscherHolderRecord.java:261)
at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processFile(ExcelExtractor.java:323)
at org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtractor.java:169)
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:177)
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:130)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
{noformat}
If this looks familiar, I'll attach an example file.
> NegativeArraySizeException on a password protected Excel workbook
> -----------------------------------------------------------------
>
> Key: TIKA-2120
> URL: https://issues.apache.org/jira/browse/TIKA-2120
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.13
> Reporter: Seva Alekseyev
>
> On the following password protected Excel file
> https://dl.dropboxusercontent.com/u/92341073/20090906%20real%20inventory.xls
> The Tika parser throws NegativeArraySizeException.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)