You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Adam Wilmer (JIRA)" <ji...@apache.org> on 2010/09/14 16:37:33 UTC

[jira] Commented: (TIKA-408) Word 6.0/7.0 documents support in office parser

    [ https://issues.apache.org/jira/browse/TIKA-408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12909268#action_12909268 ] 

Adam Wilmer commented on TIKA-408:
----------------------------------

I see POI 3.7-beta2 with this change is released and the tika dependency updated. Is there any update on enabling support for the older word format in tika? currently the following exception is being thrown

Caused by: org.apache.poi.hwpf.OldWordFileFormatException: The document is too old - Word 95 or older. Try HWPFOldDocument instead?

Happy to offer any assistance if i can be of help.

> Word 6.0/7.0 documents support in office parser
> -----------------------------------------------
>
>                 Key: TIKA-408
>                 URL: https://issues.apache.org/jira/browse/TIKA-408
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Dmitry Kuzmenko
>            Assignee: Nick Burch
>            Priority: Minor
>         Attachments: testWORD6.doc, word6.patch.gz
>
>
> Current office parser doesn't support old Word 6.0/7.0 documents.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.