You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "James Baker (JIRA)" <ji...@apache.org> on 2014/10/03 13:59:35 UTC

[jira] [Comment Edited] (TIKA-1427) PDF Images don't appear in structured view

    [ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157916#comment-14157916 ] 

James Baker edited comment on TIKA-1427 at 10/3/14 11:58 AM:
-------------------------------------------------------------

Thanks for your work on this Tim. Image extraction is working and <img> tags are being inserted into the structured view, but it is inserting them at the bottom of each page. Is it not possible to have them inserted at the correct location within the document?


was (Author: james.d.baker):
Image extraction is working and <img> tags are being inserted into the structured view, but it is inserting them at the bottom of each page. Is it not possible to have them inserted at the correct location within the document?

> PDF Images don't appear in structured view
> ------------------------------------------
>
>                 Key: TIKA-1427
>                 URL: https://issues.apache.org/jira/browse/TIKA-1427
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: James Baker
>            Assignee: Tim Allison
>              Labels: pdf
>
> When viewing, say, a Word Document, any images appear in the 'structured view' of the document as <img> tags. The same is not true of PDF documents, and we lose both the fact that there is an image present, and where it is in the document.
> Some discussion of this issue in the comments of TIKA-1396.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)