You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/11/02 21:09:00 UTC

[jira] [Comment Edited] (TIKA-2488) Outlook PST Parser fails from NullPointerException

    [ https://issues.apache.org/jira/browse/TIKA-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16236602#comment-16236602 ] 

Tim Allison edited comment on TIKA-2488 at 11/2/17 9:08 PM:
------------------------------------------------------------

Thank you for opening this and sharing a triggering file.  I'm not able to reproduce this with tika-app 1.16 or 1.17-SNAPSHOT, and I'm not able to reproduce this with 1.17-SNAPSHOT's server...will try tika-1.16 server shortly.

If you try tika-app-1.16.jar on the file, do you get the same exception?

Also, do you have any content in that pst?


was (Author: tallison@mitre.org):
Thank you for opening this and sharing a triggering file.  I'm not able to reproduce this with tika-app 1.16 or 1.17-SNAPSHOT, and I'm not able to reproduce this with 1.17-SNAPSHOT's server...will try tika-1.16 server shortly.

If you try tika-app-1.16.jar on the file, do you get the same exception?

> Outlook PST Parser fails from NullPointerException
> --------------------------------------------------
>
>                 Key: TIKA-2488
>                 URL: https://issues.apache.org/jira/browse/TIKA-2488
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.16
>         Environment: Ubuntu 64 bit, openjdk version 1.8.0_31
>            Reporter: Kyle Goode
>            Priority: Minor
>              Labels: outlook
>         Attachments: Outlook Data File - test.pst
>
>
> Getting this error when trying to parse PST file.
> commands used:
> java -jar tika-server-1.16.jar 
> curl -T test.pst http://127.0.0.1:9998/tika --header "Accept: text/plain" 
> WARN  tika: Text extraction failed
> org.apache.tika.exception.TikaException: Unable to unpack document stream
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parse(OutlookPSTParser.java:95)
>         at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>         at org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:188)
>         at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>         at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
>         at org.apache.tika.server.resource.TikaResource.parse(TikaResource.java:322)
>         at org.apache.tika.server.resource.TikaResource$5.write(TikaResource.java:421)
>         at org.apache.cxf.jaxrs.provider.BinaryDataProvider.writeTo(BinaryDataProvider.java:169)
>         at org.apache.cxf.jaxrs.utils.JAXRSUtils.writeMessageBody(JAXRSUtils.java:1389)
>         at org.apache.cxf.jaxrs.interceptor.JAXRSOutInterceptor.serializeMessage(JAXRSOutInterceptor.java:243)
>         at org.apache.cxf.jaxrs.interceptor.JAXRSOutInterceptor.processResponse(JAXRSOutInterceptor.java:119)
>         at org.apache.cxf.jaxrs.interceptor.JAXRSOutInterceptor.handleMessage(JAXRSOutInterceptor.java:82)
>         at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307)
>         at org.apache.cxf.interceptor.OutgoingChainInterceptor.handleMessage(OutgoingChainInterceptor.java:83)
>         at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:307)
>         at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
>         at org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:252)
>         at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:261)
>         at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:76)
>         at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1088)
>         at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1024)
>         at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
>         at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
>         at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
>         at org.eclipse.jetty.server.Server.handle(Server.java:370)
>         at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:494)
>         at org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:973)
>         at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:1035)
>         at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:647)
>         at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:231)
>         at org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:82)
>         at org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:696)
>         at org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:53)
>         at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
>         at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
>         at java.lang.Thread.run(Thread.java:748)
> Caused by: org.apache.tika.exception.TikaException: Unable to unpack document stream
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parseMailAttachments(OutlookPSTParser.java:241)
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:121)
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parseFolder(OutlookPSTParser.java:133)
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parse(OutlookPSTParser.java:92)
>         ... 35 more
> Caused by: java.lang.NullPointerException
>         at com.pff.PSTAttachment.getFileInputStream(PSTAttachment.java:119)
>         at org.apache.tika.parser.mbox.OutlookPSTParser.parseMailAttachments(OutlookPSTParser.java:232)
>         ... 41 more
> ERROR Problem with writing the data, class org.apache.tika.server.resource.TikaResource$5, ContentType: text/plain



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)