You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@tika.apache.org by "Damiano (JIRA)" <ji...@apache.org> on 2015/05/20 16:00:37 UTC

[jira] [Updated] (TIKA-1633) Can't extract .png images from pdf document

     [ https://issues.apache.org/jira/browse/TIKA-1633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Damiano updated TIKA-1633:
--------------------------
    Description: 
Hello,
I am running tika doing:

*java -jar tika-server-1.8.jar*

then I need to extract images from document, i use:

*curl -X PUT -H "Accept: application/zip" -T /home/damiano/html_images.pdf http://localhost:9998/unpack/all > content.zip*

In *content.zip* I only see:

__METADATA__
__TEXT__

nothing else!

  was:
Hello,
I am running tika doing:

`
java -jar tika-server-1.8.jar
`

then I need to extract images from document, i use:

`
curl -X PUT -H "Accept: application/zip" -T /home/damiano/html_images.pdf http://localhost:9998/unpack/all > content.zip
`
In **content.zip** I only see:

`
__METADATA__
__TEXT__
`

nothing else!


> Can't extract .png images from pdf document
> -------------------------------------------
>
>                 Key: TIKA-1633
>                 URL: https://issues.apache.org/jira/browse/TIKA-1633
>             Project: Tika
>          Issue Type: Bug
>          Components: server
>    Affects Versions: 1.8
>            Reporter: Damiano
>
> Hello,
> I am running tika doing:
> *java -jar tika-server-1.8.jar*
> then I need to extract images from document, i use:
> *curl -X PUT -H "Accept: application/zip" -T /home/damiano/html_images.pdf http://localhost:9998/unpack/all > content.zip*
> In *content.zip* I only see:
> __METADATA__
> __TEXT__
> nothing else!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)