You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Damiano (JIRA)" <ji...@apache.org> on 2015/05/20 16:04:59 UTC

[jira] [Commented] (TIKA-1633) Can't extract .png images from pdf document

    [ https://issues.apache.org/jira/browse/TIKA-1633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552350#comment-14552350 ] 

Damiano commented on TIKA-1633:
-------------------------------

Please take a look at the pdf.

http://www.filedropper.com/htmlimages

> Can't extract .png images from pdf document
> -------------------------------------------
>
>                 Key: TIKA-1633
>                 URL: https://issues.apache.org/jira/browse/TIKA-1633
>             Project: Tika
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 1.8
>            Reporter: Damiano
>
> Hello,
> I am running tika doing:
> *java -jar tika-server-1.8.jar*
> then I need to extract images from document, i use:
> *curl -X PUT -H "Accept: application/zip" -T /home/damiano/html_images.pdf http://localhost:9998/unpack/all > content.zip*
> In content.zip I only see:
> __METADATA__
> __TEXT__
> nothing else!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)