You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2018/02/23 14:12:00 UTC
[jira] [Resolved] (TIKA-2390) Extract images embedded in Html
[ https://issues.apache.org/jira/browse/TIKA-2390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Luis Filipe Nassif resolved TIKA-2390.
--------------------------------------
Resolution: Duplicate
Fix Version/s: 2.0.0
1.18
> Extract images embedded in Html
> -------------------------------
>
> Key: TIKA-2390
> URL: https://issues.apache.org/jira/browse/TIKA-2390
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.15
> Reporter: Luis Filipe Nassif
> Priority: Minor
> Fix For: 1.18, 2.0.0
>
>
> We should handle images embedded in html like we do for other formats, as attachments. There are encodings other than base64 used out there to embed images in html?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)