You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Damiano (JIRA)" <ji...@apache.org> on 2015/05/20 16:04:59 UTC
[jira] [Commented] (TIKA-1633) Can't extract .png images from pdf
document
[ https://issues.apache.org/jira/browse/TIKA-1633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552350#comment-14552350 ]
Damiano commented on TIKA-1633:
-------------------------------
Please take a look at the pdf.
http://www.filedropper.com/htmlimages
> Can't extract .png images from pdf document
> -------------------------------------------
>
> Key: TIKA-1633
> URL: https://issues.apache.org/jira/browse/TIKA-1633
> Project: Tika
> Issue Type: Bug
> Components: server
> Affects Versions: 1.8
> Reporter: Damiano
>
> Hello,
> I am running tika doing:
> *java -jar tika-server-1.8.jar*
> then I need to extract images from document, i use:
> *curl -X PUT -H "Accept: application/zip" -T /home/damiano/html_images.pdf http://localhost:9998/unpack/all > content.zip*
> In content.zip I only see:
> __METADATA__
> __TEXT__
> nothing else!
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)