You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2014/09/16 01:37:33 UTC

[jira] [Comment Edited] (TIKA-1414) How to extract embedded images from PDFs?

    [ https://issues.apache.org/jira/browse/TIKA-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134694#comment-14134694 ] 

Tyler Palsulich edited comment on TIKA-1414 at 9/15/14 11:37 PM:
-----------------------------------------------------------------

bq. any interest in adding an example for how to extract embedded images from pdfs in oat.example?

Sounds like a perfect example! Do we have a working version yet? I imagine it will be similar to how OCR will work (TIKA-93), since we have to "turn on" extraction there, too. I'll create a new issue to track this example right now.

Edit: See TIKA-1417.


was (Author: tpalsulich):
bq. any interest in adding an example for how to extract embedded images from pdfs in oat.example?

Sounds like a perfect example! Do we have a working version yet? I imagine it will be similar to how OCR will work (TIKA-93), since we have to "turn on" extraction there, too. I'll create a new issue to track this example right now.

> How to extract embedded images from PDFs?
> -----------------------------------------
>
>                 Key: TIKA-1414
>                 URL: https://issues.apache.org/jira/browse/TIKA-1414
>             Project: Tika
>          Issue Type: Bug
>          Components: cli
>    Affects Versions: 1.6
>         Environment: *ubuntu 14.04*
> 3.13.0-35-generic
> 64 bit
> *java version "1.6.0_32"*
> OpenJDK Runtime Environment (IcedTea6 1.13.4) (6b32-1.13.4-4ubuntu0.12.04.2)
> OpenJDK Server VM (build 23.25-b01, mixed mode)`
>            Reporter: Damiano
>              Labels: features
>
> Hello,
> as i reported in TIKA-1396 I am tring to extract embedded images from PDF files. It has not been resolved in TIka 1.6.
> I am not able to extract images from *CLI* using *--extract* parameter.
> How can I extract those images?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)