You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/09/12 13:11:33 UTC
[jira] [Commented] (TIKA-1414) How to extract embedded images from
PDFs?
[ https://issues.apache.org/jira/browse/TIKA-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14131386#comment-14131386 ]
Tim Allison commented on TIKA-1414:
-----------------------------------
>From TIKA-1396:
bq. As a hack, you can also change the value of extractInlineImages in org.apache.tika.parser.pdf.PDFParser.properties within the app jar.
Did you try this?
> How to extract embedded images from PDFs?
> -----------------------------------------
>
> Key: TIKA-1414
> URL: https://issues.apache.org/jira/browse/TIKA-1414
> Project: Tika
> Issue Type: Bug
> Components: cli
> Affects Versions: 1.6
> Environment: *ubuntu 14.04*
> 3.13.0-35-generic
> 64 bit
> *java version "1.6.0_32"*
> OpenJDK Runtime Environment (IcedTea6 1.13.4) (6b32-1.13.4-4ubuntu0.12.04.2)
> OpenJDK Server VM (build 23.25-b01, mixed mode)`
> Reporter: Damiano
> Labels: features
>
> Hello,
> as i reported in TIKA-1396 I am tring to extract embedded images from PDF files. It has not been resolved in TIka 1.6.
> I am not able to extract images from *CLI* using *--extract* parameter.
> How can I extract those images?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)