You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2022/01/24 19:40:00 UTC
[jira] [Commented] (TIKA-3661) tika-app ui is no longer processing embedded files
[ https://issues.apache.org/jira/browse/TIKA-3661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17481349#comment-17481349 ]
Tim Allison commented on TIKA-3661:
-----------------------------------
It looks like at least as far back as 1.26, we also weren't parsing embedded files in the UI. This feels weird.
> tika-app ui is no longer processing embedded files
> --------------------------------------------------
>
> Key: TIKA-3661
> URL: https://issues.apache.org/jira/browse/TIKA-3661
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Minor
>
> Discovered this during the meetup today. I was using 2.2.1, and embedded content was not extracted via the regular xhtml. It was correctly extracted by the RecursiveParserWrapper.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)