You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2021/03/26 13:13:00 UTC
[jira] [Commented] (TIKA-3338) Excel Binary files fail to parse
[ https://issues.apache.org/jira/browse/TIKA-3338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17309419#comment-17309419 ]
Tim Allison commented on TIKA-3338:
-----------------------------------
The {{/tika}} endpoint doesn't accept json. Try the {{/rmeta}} endpoint if you want json.
> Excel Binary files fail to parse
> --------------------------------
>
> Key: TIKA-3338
> URL: https://issues.apache.org/jira/browse/TIKA-3338
> Project: Tika
> Issue Type: Bug
> Reporter: Patrick
> Priority: Major
>
> $ java -jar tika-server-1.24.jar
> Mar 24, 2021 7:20:46 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: J2KImageReader not loaded. JPEG2000 files will not be processed.
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> Mar 24, 2021 7:20:47 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: org.xerial's sqlite-jdbc is not loaded.
> Please provide the jar on your classpath to parse sqlite files.
> See tika-parsers/pom.xml for the correct version.
> INFO Starting Apache Tika 1.24 server
> INFO Setting the server's publish address to be http://localhost:9998/
> INFO Logging initialized @2487ms to org.eclipse.jetty.util.log.Slf4jLog
> INFO jetty-9.4.24.v20191120; built: 2019-11-20T21:37:49.771Z; git: 363d5f2df3a8a28de40604320230664b9c793c16; jvm 1.8.0_281-b25
> INFO Started ServerConnector@75d2da2d\{HTTP/1.1,[http/1.1]}{localhost:9998}
> INFO Started @3147ms
> WARN Empty contextPath
> INFO Started o.e.j.s.h.ContextHandler@1f53a5dc\{/,null,AVAILABLE}
> INFO Started Apache Tika server at http://localhost:9998/
>
> Sending
> curl -s http://localhost:9998/tika --header "Accept: application/json" -T "d:\7bdf7e50f1683b93fba9f30277bde9f4.xlsb" | jq
>
> Generates: the following in the Server terminal window.
>
> INFO JVM Runtime does not support Modules
> INFO rmeta (autodetecting type)
> WARN javax.ws.rs.ClientErrorException: HTTP 406 Not Acceptable
> at org.apache.cxf.jaxrs.utils.SpecExceptions.toHttpException(SpecExceptions.java:117)
> at org.apache.cxf.jaxrs.utils.ExceptionUtils.toHttpException(ExceptionUtils.java:168)
> at org.apache.cxf.jaxrs.utils.JAXRSUtils.findTargetMethod(JAXRSUtils.java:516)
> at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.processRequest(JAXRSInInterceptor.java:181)
> at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.handleMessage(JAXRSInInterceptor.java:78)
> at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:308)
> at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
> at org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:267)
> at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:247)
> at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:79)
> at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
> at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:235)
> at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1297)
> at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:190)
> at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1212)
> at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
> at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:221)
> at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
> at org.eclipse.jetty.server.Server.handle(Server.java:500)
> at org.eclipse.jetty.server.HttpChannel.lambda$handle$1(HttpChannel.java:383)
> at org.eclipse.jetty.server.HttpChannel.dispatch(HttpChannel.java:547)
> at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:375)
> at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:270)
> at org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:311)
> at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:103)
> at org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:117)
> at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:336)
> at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:313)
> at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:171)
> at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.produce(EatWhatYouKill.java:135)
> at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:806)
> at org.eclipse.jetty.util.thread.QueuedThreadPool$Runner.run(QueuedThreadPool.java:938)
> at java.lang.Thread.run(Unknown Source)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)