You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Damiano (JIRA)" <ji...@apache.org> on 2014/09/29 10:00:53 UTC

[jira] [Created] (TIKA-1431) How to extract embedded images in a document?

Damiano created TIKA-1431:
-----------------------------

             Summary: How to extract embedded images in a document?
                 Key: TIKA-1431
                 URL: https://issues.apache.org/jira/browse/TIKA-1431
             Project: Tika
          Issue Type: Bug
          Components: server
    Affects Versions: 1.6
         Environment: *ubuntu 14.04 LTS*
{{quote}}
MD A10-5800K APU with Radeon(tm) HD Graphics × 4 
{{quote}}

*java version "1.6.0_32"*
{{quote}}
OpenJDK Runtime Environment (IcedTea6 1.13.4) (6b32-1.13.4-4ubuntu0.12.04.2)
OpenJDK Client VM (build 23.25-b01, mixed mode, sharing)
{{quote}}
            Reporter: Damiano


Hello,

I just downloaded Tika Server from here: https://archive.apache.org/dist/tika/tika-server-1.6.jar

To start the server I did: *java -jar tika-server-1.6.jar  --port=12345*

The server has started without problem, then I tried to do few tests using `curl`.

I have noticed the error using:

{{quote}}
curl -T test.pdf http://localhost:12345/all > x.zip
{{quote}}

{{quote}}
Sep 29, 2014 7:50:48 AM org.apache.cxf.jaxrs.utils.JAXRSUtils findTargetMethod
WARNING: No operation matching request path "/all" is found, Relative Path: /all, HTTP Method: PUT, ContentType: */*, Accept: */*,. Please enable FINE/TRACE log level for more details.
Sep 29, 2014 7:50:48 AM org.apache.cxf.jaxrs.impl.WebApplicationExceptionMapper toResponse
WARNING: javax.ws.rs.ClientErrorException
	at org.apache.cxf.jaxrs.utils.JAXRSUtils.findTargetMethod(JAXRSUtils.java:503)
	at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.processRequest(JAXRSInInterceptor.java:218)
	at org.apache.cxf.jaxrs.interceptor.JAXRSInInterceptor.handleMessage(JAXRSInInterceptor.java:90)
	at org.apache.cxf.phase.PhaseInterceptorChain.doIntercept(PhaseInterceptorChain.java:272)
	at org.apache.cxf.transport.ChainInitiationObserver.onMessage(ChainInitiationObserver.java:121)
	at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.serviceRequest(JettyHTTPDestination.java:355)
	at org.apache.cxf.transport.http_jetty.JettyHTTPDestination.doService(JettyHTTPDestination.java:319)
	at org.apache.cxf.transport.http_jetty.JettyHTTPHandler.handle(JettyHTTPHandler.java:72)
	at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1088)
	at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1024)
	at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
	at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:255)
	at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
	at org.eclipse.jetty.server.Server.handle(Server.java:370)
	at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:494)
	at org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:971)
	at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpConnection.java:1033)
	at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:651)
	at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235)
	at org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:82)
	at org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:667)
	at org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:52)
	at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
	at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
	at java.lang.Thread.run(Thread.java:701)
{{quote}}

I get an empty x.zip file. I was trying to use this command to extract embedded images inside my test.pdf.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)