You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Patrick (Jira)" <ji...@apache.org> on 2021/03/24 23:50:00 UTC

[jira] [Created] (TIKA-3338) Excel Binary files fail to parse

Patrick created TIKA-3338:
-----------------------------

             Summary: Excel Binary files fail to parse
                 Key: TIKA-3338
                 URL: https://issues.apache.org/jira/browse/TIKA-3338
             Project: Tika
          Issue Type: Bug
            Reporter: Patrick


$ java -jar tika-server-1.24.jar
Mar 24, 2021 7:20:46 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
WARNING: J2KImageReader not loaded. JPEG2000 files will not be processed.
See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
for optional dependencies.

Mar 24, 2021 7:20:47 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
WARNING: org.xerial's sqlite-jdbc is not loaded.
Please provide the jar on your classpath to parse sqlite files.
See tika-parsers/pom.xml for the correct version.
INFO Starting Apache Tika 1.24 server
INFO Setting the server's publish address to be http://localhost:9998/
INFO Logging initialized @2487ms to org.eclipse.jetty.util.log.Slf4jLog
INFO jetty-9.4.24.v20191120; built: 2019-11-20T21:37:49.771Z; git: 363d5f2df3a8a28de40604320230664b9c793c16; jvm 1.8.0_281-b25
INFO Started ServerConnector@75d2da2d\{HTTP/1.1,[http/1.1]}{localhost:9998}
INFO Started @3147ms
WARN Empty contextPath
INFO Started o.e.j.s.h.ContextHandler@1f53a5dc\{/,null,AVAILABLE}
INFO Started Apache Tika server at http://localhost:9998/

 

Sending

curl -s http://localhost:9998/tika --header "Accept: application/json" -T "d:\7bdf7e50f1683b93fba9f30277bde9f4.xlsb" | jq

 

Generates: the following in the Server terminal window.

 

INFO JVM Runtime does not support Modules
INFO rmeta (autodetecting type)
WARN javax.ws.rs.ClientErrorException: HTTP 406 Not Acceptable
 at org.apache.cxf.jaxrs.utils.SpecExceptions.toHttpException(SpecExceptions.java:117)
 at org.apache.cxf.jaxrs.utils.ExceptionUtils.toHttpException(ExceptionUtils.java:168)
 at org.apache.cxf.jaxrs.utils.JAXRSUtils.findTargetMethod(JAXRSUtils.java:516)
 at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.processRequest(JAXRSInInterceptor.java:181)
 at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.handleMessage(JAXRSInInterceptor.java:78)
 at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:308)
 at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
 at org.apache.cxf.transport.http.AbstractHTTPDestination.invoke(AbstractHTTPDestination.java:267)
 at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:247)
 at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:79)
 at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
 at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:235)
 at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1297)
 at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:190)
 at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1212)
 at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
 at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:221)
 at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
 at org.eclipse.jetty.server.Server.handle(Server.java:500)
 at org.eclipse.jetty.server.HttpChannel.lambda$handle$1(HttpChannel.java:383)
 at org.eclipse.jetty.server.HttpChannel.dispatch(HttpChannel.java:547)
 at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:375)
 at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:270)
 at org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:311)
 at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:103)
 at org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:117)
 at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:336)
 at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:313)
 at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:171)
 at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.produce(EatWhatYouKill.java:135)
 at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:806)
 at org.eclipse.jetty.util.thread.QueuedThreadPool$Runner.run(QueuedThreadPool.java:938)
 at java.lang.Thread.run(Unknown Source)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)