You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (Jira)" <ji...@apache.org> on 2021/02/02 09:58:00 UTC

[jira] [Commented] (TIKA-3290) Extension reading it as eml instead of txt

    [ https://issues.apache.org/jira/browse/TIKA-3290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276999#comment-17276999 ] 

Nick Burch commented on TIKA-3290:
----------------------------------

At first glance, this does seem to be a series of emails, so detecting as an email seems correct.

What problem are you facing with the new detection? Do you disagree with it being emails?

> Extension reading it as eml instead of txt
> ------------------------------------------
>
>                 Key: TIKA-3290
>                 URL: https://issues.apache.org/jira/browse/TIKA-3290
>             Project: Tika
>          Issue Type: Bug
>          Components: core, mime
>    Affects Versions: 1.25
>            Reporter: Vamsi Molli
>            Priority: Major
>              Labels: tika-parsers
>             Fix For: 1.24.1
>
>         Attachments: test_sample_message.txt
>
>
> The attached file extension is reading it as eml instead of txt. With version 1.24.1 it is reading it as txt and now with the upgrade to 1.25, it is reading it as eml. So that while parsing we are getting mail corrupted error.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)