You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2021/10/19 10:31:00 UTC

[jira] [Commented] (TIKA-3576) RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

    [ https://issues.apache.org/jira/browse/TIKA-3576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17430464#comment-17430464 ] 

Tim Allison commented on TIKA-3576:
-----------------------------------

Thank you for opening this issue and attaching a triggering file.  It looks like it was already fixed in 2.x, but I never backported the fix to 1.x.  It is now fixed in 1.x.

> RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser
> ------------------------------------------------------------------------
>
>                 Key: TIKA-3576
>                 URL: https://issues.apache.org/jira/browse/TIKA-3576
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.27
>         Environment: Windows 10 x64
>            Reporter: redmanmale
>            Priority: Major
>         Attachments: 01.pptx
>
>
> I try to parse pptx document and get this error:
> org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@2a9ddabb
>  at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:297)
>  at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:281)
>  at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
> <business logic>
> Caused by: java.lang.NullPointerException
>  at org.apache.tika.utils.RereadableInputStream.saveByte(RereadableInputStream.java:265)
>  at org.apache.tika.utils.RereadableInputStream.read(RereadableInputStream.java:166)
>  at org.apache.tika.utils.RereadableInputStream.rewind(RereadableInputStream.java:180)
>  at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:116)
>  at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:113)
>  at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:281)
>  ... 11 more
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)