You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/11/12 06:12:34 UTC

[jira] [Commented] (TIKA-1473) Apache Tika is not working for .docx documents

    [ https://issues.apache.org/jira/browse/TIKA-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207694#comment-14207694 ] 

Nick Burch commented on TIKA-1473:
----------------------------------

Any chance of the file that triggers the problem, and the code to trigger it (ideally in the form of a junit test)?

We have lots of unit tests for .docx files, so we'll need to investigate what's going wrong for your situation

> Apache Tika is not working for .docx documents 
> -----------------------------------------------
>
>                 Key: TIKA-1473
>                 URL: https://issues.apache.org/jira/browse/TIKA-1473
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.5, 1.6
>            Reporter: Franco Catto
>            Priority: Blocker
>
> I am using Apache Tika 1.6 to read different document files. 
> It is reading pdf and old format doc files but when I try to read docx file, it gives me following exception:
> org.apache.tika.exception.TikaException: Failed to close temporary resources at org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127) ...
> The resource can not be closed because it is still being used by the Java Process, certainly the OOXML parser.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)