You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Trejkaz (JIRA)" <ji...@apache.org> on 2014/11/19 06:53:34 UTC

[jira] [Commented] (TIKA-1358) Add support for newer iWork file formats

    [ https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14217468#comment-14217468 ] 

Trejkaz commented on TIKA-1358:
-------------------------------

And of course now iWork is using zip files [again? There is some confusion over whether they were ever using zip files, because the Apple mail client transparently zips package-based documents for transmission over email, so maybe iWork never used zip files until now].

I think the new format is more or less the same as the previous format, just with an outer zip shell on it.


> Add support for newer iWork file formats
> ----------------------------------------
>
>                 Key: TIKA-1358
>                 URL: https://issues.apache.org/jira/browse/TIKA-1358
>             Project: Tika
>          Issue Type: Wish
>          Components: parser
>    Affects Versions: 1.5
>            Reporter: Jelle Kastelein
>              Labels: newbie
>         Attachments: iwork13-testdocs-zips.zip
>
>
> IWork 2013 uses a revised file format which replaces the xml files that hold the content by .iwa files (a binary format). This file format is becoming increasingly relevant as more and more people are using apple products. However, it does not appear to work with the current IWorkPackageParser (tested with several of the example .pages files one can get from the iCloud). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)