You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2019/11/12 16:28:00 UTC
[jira] [Commented] (TIKA-2981) Issue with parsing .numbers file
[ https://issues.apache.org/jira/browse/TIKA-2981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16972586#comment-16972586 ]
Tim Allison commented on TIKA-2981:
-----------------------------------
Right. Unfortunately, we still don't have a parser for iworks files > 2013... :(
We might be able to improve detection by a few more heuristics...short of a full parse.
I'll take a look.
> Issue with parsing .numbers file
> --------------------------------
>
> Key: TIKA-2981
> URL: https://issues.apache.org/jira/browse/TIKA-2981
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.22
> Reporter: Szymon
> Priority: Major
> Attachments: example.numbers
>
>
> Hi,
> I have a very similar problem to https://issues.apache.org/jira/browse/TIKA-1966.
> I'm using `new Tika().parseToString(file)` method, but also tried `IWorkPackageParser` directly.
> I'm using `version 6.1 (6369)` of Numbers.
> I'm using `version 1.22` of Tika.
> Please see attached file.
>
> Output: ```
> Index/Document.iwa
> Index/ViewState.iwa
> Index/CalculationEngine.iwa
> Index/Tables/Tile.iwa
> Index/Tables/DataList.iwa
> Index/Tables/DataList-3670-2.iwa
> Index/Tables/DataList-3671-2.iwa
> Index/Tables/DataList-3672-2.iwa
> ...
> ```
>
> Thank you,
> Szymon
--
This message was sent by Atlassian Jira
(v8.3.4#803005)