You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Martijn van Groningen (JIRA)" <ji...@apache.org> on 2010/05/14 15:30:42 UTC

[jira] Issue Comment Edited: (TIKA-402) Support for Keynote and Pages documents

    [ https://issues.apache.org/jira/browse/TIKA-402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12867499#action_12867499 ] 

Martijn van Groningen edited comment on TIKA-402 at 5/14/10 9:30 AM:
---------------------------------------------------------------------

Thanks for adding! Yes I forgot to add the Apache licence, so good you did that. I've updated my patch. The following has changed:
* The test for pages did not have any assertions. So I added them to match with the test Pages document.
* I noticed that for Keynote presentations the table wasn't parsed. I fixed that and adjusted the Keynote test and the testKeynote.key file.

I will work on the Numbers support in the coming days.

      was (Author: martijn):
    Thanks for adding! Yes I forgot to add the Apache licence, so good you did that. I've updated my patch. The following has changed:
* The test for pages did not have any assertions. So I added them to match with the test Pages document.
* I noticed that for Keynote presentations the table wasn't parsed. I fixed that and adjusted the Keynote test and the testKeynote.key file.

I will work on the number support in the coming days.
  
> Support for Keynote and Pages documents
> ---------------------------------------
>
>                 Key: TIKA-402
>                 URL: https://issues.apache.org/jira/browse/TIKA-402
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>         Attachments: iwork.patch, iwork.patch, iwork.patch, iwork.patch, testKeynote.key, testKeynote.key, testPages.pages
>
>
> It would be nice to have support for documents created by Apple's Keynote and Pages applications. Both file formats are described in http://developer.apple.com/mac/library/documentation/AppleApplications/Conceptual/iWork2-0_XML/Chapter01/Introduction.html. I'm not sure if there already are open source parser libraries for these formats or if we'd need to directly process the XML content.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.