You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2019/11/12 16:28:00 UTC

[jira] [Commented] (TIKA-2981) Issue with parsing .numbers file

    [ https://issues.apache.org/jira/browse/TIKA-2981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16972586#comment-16972586 ] 

Tim Allison commented on TIKA-2981:
-----------------------------------

Right.  Unfortunately, we still don't have a parser for iworks files > 2013... :(

We might be able to improve detection by a few more heuristics...short of a full parse.

I'll take a look.

> Issue with parsing .numbers file
> --------------------------------
>
>                 Key: TIKA-2981
>                 URL: https://issues.apache.org/jira/browse/TIKA-2981
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.22
>            Reporter: Szymon
>            Priority: Major
>         Attachments: example.numbers
>
>
> Hi,
> I have a very similar problem to https://issues.apache.org/jira/browse/TIKA-1966.
> I'm using `new Tika().parseToString(file)` method, but also tried `IWorkPackageParser` directly.
> I'm using `version 6.1 (6369)` of Numbers.
> I'm using `version 1.22` of Tika.
> Please see attached file.
>  
> Output: ```
> Index/Document.iwa
> Index/ViewState.iwa
> Index/CalculationEngine.iwa
> Index/Tables/Tile.iwa
> Index/Tables/DataList.iwa
> Index/Tables/DataList-3670-2.iwa
> Index/Tables/DataList-3671-2.iwa
> Index/Tables/DataList-3672-2.iwa
>  ...
> ```
>  
> Thank you,
> Szymon



--
This message was sent by Atlassian Jira
(v8.3.4#803005)