You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/09/29 13:08:33 UTC

[jira] [Commented] (TIKA-1431) How to extract embedded images in a document?

    [ https://issues.apache.org/jira/browse/TIKA-1431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14151587#comment-14151587 ] 

Nick Burch commented on TIKA-1431:
----------------------------------

If you go to http://localhost:9998/ you'll see the list of all the available endpoints

/all is not a valid endpoint in Tika 1.6, I think you probably want /unpack/all

> How to extract embedded images in a document?
> ---------------------------------------------
>
>                 Key: TIKA-1431
>                 URL: https://issues.apache.org/jira/browse/TIKA-1431
>             Project: Tika
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 1.6
>         Environment: *ubuntu 14.04 LTS*
> {quote}
> MD A10-5800K APU with Radeon(tm) HD Graphics × 4 
> {quote}
> *java version "1.6.0_32"*
> {quote}
> OpenJDK Runtime Environment (IcedTea6 1.13.4) (6b32-1.13.4-4ubuntu0.12.04.2)
> OpenJDK Client VM (build 23.25-b01, mixed mode, sharing)
> {quote}
>            Reporter: Damiano
>              Labels: 1.6, server
>
> Hello,
> I just downloaded Tika Server from here: https://archive.apache.org/dist/tika/tika-server-1.6.jar
> To start the server I did: *java -jar tika-server-1.6.jar  --port=12345*
> The server has started without problem, then I tried to do few tests using `curl`.
> I have noticed the error using: *curl -T test.pdf http://localhost:12345/all > x.zip*
> {quote}
> Sep 29, 2014 7:50:48 AM org.apache.cxf.jaxrs.utils.JAXRSUtils findTargetMethod
> WARNING: No operation matching request path "/all" is found, Relative Path: /all, HTTP Method: PUT, ContentType: */*, Accept: */*,. Please enable FINE/TRACE log level for more details.
> Sep 29, 2014 7:50:48 AM org.apache.cxf.jaxrs.impl.WebApplicationExceptionMapper toResponse
> WARNING: javax.ws.rs.ClientErrorException
> 	at org.apache.cxf.jaxrs.utils.JAXRSUtils.findTargetMethod(JAXRSUtils.java:503)
> 	at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.processRequest(JAXRSInInterceptor.java:218)
> 	at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.handleMessage(JAXRSInInterceptor.java:90)
> 	at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:272)
> 	at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
> 	at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.serviceRequest(JettyHTTPDestination.java:355)
> 	at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:319)
> 	at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:72)
> 	at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1088)
> 	at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1024)
> 	at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
> 	at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
> 	at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
> 	at org.eclipse.jetty.server.Server.handle(Server.java:370)
> 	at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:494)
> 	at org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:971)
> 	at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:1033)
> 	at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:651)
> 	at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235)
> 	at org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:82)
> 	at org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:667)
> 	at org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:52)
> 	at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
> 	at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
> 	at java.lang.Thread.run(Thread.java:701)
> {quote}
> I get an empty x.zip file. I was trying to use this command to extract embedded images inside my test.pdf.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)