You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2022/01/24 19:41:00 UTC
[jira] [Comment Edited] (TIKA-3661) tika-app ui is no longer processing embedded files
[ https://issues.apache.org/jira/browse/TIKA-3661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17481349#comment-17481349 ]
Tim Allison edited comment on TIKA-3661 at 1/24/22, 7:40 PM:
-------------------------------------------------------------
It looks like at least as far back as at least 1.26, we also weren't parsing embedded files in the UI. This feels weird.
was (Author: tallison@mitre.org):
It looks like at least as far back as 1.26, we also weren't parsing embedded files in the UI. This feels weird.
> tika-app ui is no longer processing embedded files
> --------------------------------------------------
>
> Key: TIKA-3661
> URL: https://issues.apache.org/jira/browse/TIKA-3661
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Minor
>
> Discovered this during the meetup today. I was using 2.2.1, and embedded content was not extracted via the regular xhtml. It was correctly extracted by the RecursiveParserWrapper.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)