You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2018/02/23 14:12:00 UTC

[jira] [Resolved] (TIKA-2390) Extract images embedded in Html

     [ https://issues.apache.org/jira/browse/TIKA-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Luis Filipe Nassif resolved TIKA-2390.
--------------------------------------
       Resolution: Duplicate
    Fix Version/s: 2.0.0
                   1.18

> Extract images embedded in Html
> -------------------------------
>
>                 Key: TIKA-2390
>                 URL: https://issues.apache.org/jira/browse/TIKA-2390
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.15
>            Reporter: Luis Filipe Nassif
>            Priority: Minor
>             Fix For: 1.18, 2.0.0
>
>
> We should handle images embedded in html like we do for other formats, as attachments. There are encodings other than base64 used out there to embed images in html?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)