You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2021/03/26 13:17:00 UTC

[jira] [Comment Edited] (TIKA-3338) Excel Binary files fail to parse

    [ https://issues.apache.org/jira/browse/TIKA-3338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17309419#comment-17309419 ] 

Tim Allison edited comment on TIKA-3338 at 3/26/21, 1:16 PM:
-------------------------------------------------------------

The {{/tika}} endpoint doesn't accept json.  Try the {{/rmeta}} endpoint if you want json.

 

See: [https://cwiki.apache.org/confluence/display/TIKA/TikaServer]

 

Please ask on our user list if you have any further questions: user@tika.apache.org


was (Author: tallison@mitre.org):
The {{/tika}} endpoint doesn't accept json.  Try the {{/rmeta}} endpoint if you want json.

> Excel Binary files fail to parse
> --------------------------------
>
>                 Key: TIKA-3338
>                 URL: https://issues.apache.org/jira/browse/TIKA-3338
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Patrick
>            Priority: Major
>
> $ java -jar tika-server-1.24.jar
> Mar 24, 2021 7:20:46 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: J2KImageReader not loaded. JPEG2000 files will not be processed.
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> Mar 24, 2021 7:20:47 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: org.xerial's sqlite-jdbc is not loaded.
> Please provide the jar on your classpath to parse sqlite files.
> See tika-parsers/pom.xml for the correct version.
> INFO Starting Apache Tika 1.24 server
> INFO Setting the server's publish address to be http://localhost:9998/
> INFO Logging initialized @2487ms to org.eclipse.jetty.util.log.Slf4jLog
> INFO jetty-9.4.24.v20191120; built: 2019-11-20T21:37:49.771Z; git: 363d5f2df3a8a28de40604320230664b9c793c16; jvm 1.8.0_281-b25
> INFO Started ServerConnector@75d2da2d\{HTTP/1.1,[http/1.1]}{localhost:9998}
> INFO Started @3147ms
> WARN Empty contextPath
> INFO Started o.e.j.s.h.ContextHandler@1f53a5dc\{/,null,AVAILABLE}
> INFO Started Apache Tika server at http://localhost:9998/
>  
> Sending
> curl -s http://localhost:9998/tika --header "Accept: application/json" -T "d:\7bdf7e50f1683b93fba9f30277bde9f4.xlsb" | jq
>  
> Generates: the following in the Server terminal window.
>  
> INFO JVM Runtime does not support Modules
> INFO rmeta (autodetecting type)
> WARN javax.ws.rs.ClientErrorException: HTTP 406 Not Acceptable
>  at org.apache.cxf.jaxrs.utils.SpecExceptions.toHttpException(SpecExceptions.java:117)
>  at org.apache.cxf.jaxrs.utils.ExceptionUtils.toHttpException(ExceptionUtils.java:168)
>  at org.apache.cxf.jaxrs.utils.JAXRSUtils.findTargetMethod(JAXRSUtils.java:516)
>  at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.processRequest(JAXRSInInterceptor.java:181)
>  at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.handleMessage(JAXRSInInterceptor.java:78)
>  at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:308)
>  at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
>  at org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:267)
>  at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:247)
>  at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:79)
>  at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
>  at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:235)
>  at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1297)
>  at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:190)
>  at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1212)
>  at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
>  at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:221)
>  at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
>  at org.eclipse.jetty.server.Server.handle(Server.java:500)
>  at org.eclipse.jetty.server.HttpChannel.lambda$handle$1(HttpChannel.java:383)
>  at org.eclipse.jetty.server.HttpChannel.dispatch(HttpChannel.java:547)
>  at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:375)
>  at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:270)
>  at org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:311)
>  at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:103)
>  at org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:117)
>  at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:336)
>  at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:313)
>  at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:171)
>  at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.produce(EatWhatYouKill.java:135)
>  at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:806)
>  at org.eclipse.jetty.util.thread.QueuedThreadPool$Runner.run(QueuedThreadPool.java:938)
>  at java.lang.Thread.run(Unknown Source)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)