You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/05/12 17:46:42 UTC

[jira] Commented: (TIKA-418) RuntimeException while getting content for ppsx, ppsm, pptm, thmx and xps file types

    [ https://issues.apache.org/jira/browse/TIKA-418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12866599#action_12866599 ] 

Jukka Zitting commented on TIKA-418:
------------------------------------

It would be helpful if you could attach either the exception stack trace you're getting or an example document that causes this problem.

It looks like support for these file types is not yet included in the Apache POI library that we use for parsing Microsoft Office files.

> RuntimeException while getting content for ppsx, ppsm, pptm, thmx and xps file types
> ------------------------------------------------------------------------------------
>
>                 Key: TIKA-418
>                 URL: https://issues.apache.org/jira/browse/TIKA-418
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.7
>         Environment: Windows
>            Reporter: Rajiv Kumar
>
> I am getting the following error
> Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@269b15
> for the following file types
> .PPSM
> .PPSX
> .PPTM
> .THMX
> .XPS
> .XLSB

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.