You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ahmed Owian (JIRA)" <ji...@apache.org> on 2015/04/24 14:43:38 UTC

[jira] [Commented] (TIKA-291) Adobe InDesign support

    [ https://issues.apache.org/jira/browse/TIKA-291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14510966#comment-14510966 ] 

Ahmed Owian commented on TIKA-291:
----------------------------------

When attempting to scan the bytestream to find the xmp packets, we encountered custom control chars which were invalid xml.  Therefore, we opted to use exiftool to parse the xmp, and we took it from there.  See https://issues.alfresco.com/jira/browse/MM-371 for details.

> Adobe InDesign support
> ----------------------
>
>                 Key: TIKA-291
>                 URL: https://issues.apache.org/jira/browse/TIKA-291
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Jukka Zitting
>            Priority: Minor
>              Labels: new-parser
>         Attachments: simple_test-1.indd
>
>
> It would be great if Tika could extract content from Adobe InDesign documents.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)