You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Vinay Kawade (JIRA)" <ji...@apache.org> on 2018/01/09 19:38:00 UTC
[jira] [Updated] (TIKA-2196) IllegalArgumentException on a valid
Excel file
[ https://issues.apache.org/jira/browse/TIKA-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vinay Kawade updated TIKA-2196:
-------------------------------
Attachment: 1.xls
Sample file with only one sheet and 2 cells populated for testing.
> IllegalArgumentException on a valid Excel file
> ----------------------------------------------
>
> Key: TIKA-2196
> URL: https://issues.apache.org/jira/browse/TIKA-2196
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.14
> Environment: Windows 7 x64, JVM 1.8.0_101
> Reporter: Seva Alekseyev
> Attachments: 1.xls, 2007 Experiment watch.xls
>
>
> On the attached Excel file, which opens fine in Excel, Tika throws the following error:
> java.lang.IllegalArgumentException: Cannot format given Object as a Number
> at java.text.DecimalFormat.format:-1
> at org.apache.poi.ss.usermodel.ExcelGeneralNumberFormat.format:67
> at java.text.Format.format:-1
> at org.apache.poi.ss.usermodel.DataFormatter.performDateFormatting:736
> at org.apache.poi.ss.usermodel.DataFormatter.formatRawCellContents:804
> at org.apache.poi.ss.usermodel.DataFormatter.formatRawCellContents:785
> at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.formatNumberDateCell:143
> at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener$TikaFormatTrackingHSSFListener.formatNumberDateCell:633
> at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.internalProcessRecord:405
> at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processRecord:336
> at org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.processRecord:92
> at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord:109
> at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProcessEvents:179
> at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents:136
> at org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processFile:312
> at org.apache.tika.parser.microsoft.ExcelExtractor.parse:169
> at org.apache.tika.parser.microsoft.OfficeParser.parse:177
> at org.apache.tika.parser.microsoft.OfficeParser.parse:130
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)