You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Daan de Wit (JIRA)" <ji...@apache.org> on 2009/07/17 15:23:14 UTC

[jira] Updated: (TIKA-262) ParsingReader does not parse metadata for larger MS Office documents

     [ https://issues.apache.org/jira/browse/TIKA-262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Daan de Wit updated TIKA-262:
-----------------------------

    Attachment: lipsum.doc

word document to reproduce the issue

> ParsingReader does not parse metadata for larger MS Office documents
> --------------------------------------------------------------------
>
>                 Key: TIKA-262
>                 URL: https://issues.apache.org/jira/browse/TIKA-262
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.3
>            Reporter: Daan de Wit
>         Attachments: lipsum.doc
>
>
> The ParsingReader should cause the metadata to be extracted before anything is read from the reader. This is not done for certain MS Office files, it seems to be related to the size of the document.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.