You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2023/09/25 15:06:00 UTC

[jira] [Commented] (TIKA-4140) For Outlook emails with a signature, the attachments are not processed.

    [ https://issues.apache.org/jira/browse/TIKA-4140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17768738#comment-17768738 ] 

Tim Allison commented on TIKA-4140:
-----------------------------------

Ref: https://bz.apache.org/bugzilla/show_bug.cgi?id=67083

> For Outlook emails with a signature, the attachments are not processed.
> -----------------------------------------------------------------------
>
>                 Key: TIKA-4140
>                 URL: https://issues.apache.org/jira/browse/TIKA-4140
>             Project: Tika
>          Issue Type: Bug
>          Components: handler
>    Affects Versions: 2.9.0
>         Environment: Java 17
>            Reporter: Rainer Schnitker
>            Priority: Major
>         Attachments: Outlook-Mail-Signature.zip
>
>
> For Outlook emails with a signature, the attachments are not processed. It is not entirely clear whether the class "org.apache.tika.parser.microsoft.OutlookExtractor" has a problem or the POI component used.
> The issure attachement zip file has the same example with or without signature:
>  * File "HTML-Mail without Signature.msg"
>  * File "HTML-Mail Signature Elster.msg"
> case a) the attachements (word and pdf) are processed
> case b) the attachements are not processed  (only one blob, base64 encoded)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)