You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Manish (JIRA)" <ji...@apache.org> on 2010/08/21 12:28:16 UTC

[jira] Created: (TIKA-489) Embedded Documents within documents

Embedded Documents within documents
-----------------------------------

                 Key: TIKA-489
                 URL: https://issues.apache.org/jira/browse/TIKA-489
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 0.7
         Environment: All
            Reporter: Manish
            Priority: Trivial
             Fix For: 1.0


If there are embedded documents(objects) without word files, those are not getting parsed. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-489) Embedded Documents within documents

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12902388#action_12902388 ] 

Nick Burch commented on TIKA-489:
---------------------------------

Is the information in http://wiki.apache.org/tika/RecursiveMetadata any help with getting at the information you want?

> Embedded Documents within documents
> -----------------------------------
>
>                 Key: TIKA-489
>                 URL: https://issues.apache.org/jira/browse/TIKA-489
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>         Environment: All
>            Reporter: Manish
>            Priority: Trivial
>             Fix For: 1.0
>
>
> If there are embedded documents(objects) without word files, those are not getting parsed. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (TIKA-489) Embedded Documents within documents

Posted by "Manish (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Manish updated TIKA-489:
------------------------

    Attachment: doc1.doc

I am trying to parse the attached file doc1.doc. 
I has doc2 embedded within it. 
Anyway to get the content of doc2? 

> Embedded Documents within documents
> -----------------------------------
>
>                 Key: TIKA-489
>                 URL: https://issues.apache.org/jira/browse/TIKA-489
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>         Environment: All
>            Reporter: Manish
>            Priority: Trivial
>             Fix For: 1.0
>
>         Attachments: doc1.doc
>
>
> If there are embedded documents(objects) without word files, those are not getting parsed. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.