You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/09/12 13:11:33 UTC

[jira] [Commented] (TIKA-1414) How to extract embedded images from PDFs?

    [ https://issues.apache.org/jira/browse/TIKA-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14131386#comment-14131386 ] 

Tim Allison commented on TIKA-1414:
-----------------------------------

>From TIKA-1396:
bq. As a hack, you can also change the value of extractInlineImages in org.apache.tika.parser.pdf.PDFParser.properties within the app jar. 

Did you try this?

> How to extract embedded images from PDFs?
> -----------------------------------------
>
>                 Key: TIKA-1414
>                 URL: https://issues.apache.org/jira/browse/TIKA-1414
>             Project: Tika
>          Issue Type: Bug
>          Components: cli
>    Affects Versions: 1.6
>         Environment: *ubuntu 14.04*
> 3.13.0-35-generic
> 64 bit
> *java version "1.6.0_32"*
> OpenJDK Runtime Environment (IcedTea6 1.13.4) (6b32-1.13.4-4ubuntu0.12.04.2)
> OpenJDK Server VM (build 23.25-b01, mixed mode)`
>            Reporter: Damiano
>              Labels: features
>
> Hello,
> as i reported in TIKA-1396 I am tring to extract embedded images from PDF files. It has not been resolved in TIka 1.6.
> I am not able to extract images from *CLI* using *--extract* parameter.
> How can I extract those images?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)