You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2023/09/25 15:06:00 UTC
[jira] [Commented] (TIKA-4140) For Outlook emails with a signature, the attachments are not processed.
[ https://issues.apache.org/jira/browse/TIKA-4140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17768738#comment-17768738 ]
Tim Allison commented on TIKA-4140:
-----------------------------------
Ref: https://bz.apache.org/bugzilla/show_bug.cgi?id=67083
> For Outlook emails with a signature, the attachments are not processed.
> -----------------------------------------------------------------------
>
> Key: TIKA-4140
> URL: https://issues.apache.org/jira/browse/TIKA-4140
> Project: Tika
> Issue Type: Bug
> Components: handler
> Affects Versions: 2.9.0
> Environment: Java 17
> Reporter: Rainer Schnitker
> Priority: Major
> Attachments: Outlook-Mail-Signature.zip
>
>
> For Outlook emails with a signature, the attachments are not processed. It is not entirely clear whether the class "org.apache.tika.parser.microsoft.OutlookExtractor" has a problem or the POI component used.
> The issure attachement zip file has the same example with or without signature:
> * File "HTML-Mail without Signature.msg"
> * File "HTML-Mail Signature Elster.msg"
> case a) the attachements (word and pdf) are processed
> case b) the attachements are not processed (only one blob, base64 encoded)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)