You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/04/28 01:56:50 UTC

[jira] [Commented] (TIKA-907) Comments embedded in Pages documents not supported

    [ https://issues.apache.org/jira/browse/TIKA-907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13264126#comment-13264126 ] 

Nick Burch commented on TIKA-907:
---------------------------------

Support added in r1331640. We now collect the annotations (id -> text) when they occur earlier in the file. When handling the main text, when we reach an annotation reference we output the annotation text for it. The annotation currently comes before the text it annotates, due to the order of the elements, but that could be fixed in future if needed (when we have a better document model)
                
> Comments embedded in Pages documents not supported
> --------------------------------------------------
>
>                 Key: TIKA-907
>                 URL: https://issues.apache.org/jira/browse/TIKA-907
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Windows 7
>            Reporter: Gabriel Valencia
>              Labels: comment, iWork
>             Fix For: 1.2
>
>         Attachments: testPagesCommentsJIRA.pages, testPagesShareiWorkJIRA.pages
>
>
> Comments added to a Pages document are not extracted. This also applies to documents annotated on iWork.com.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira