You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Matthew Caruana Galizia (JIRA)" <ji...@apache.org> on 2017/08/31 16:04:00 UTC

[jira] [Created] (TIKA-2455) Flag in metadata for alternative email bodies

Matthew Caruana Galizia created TIKA-2455:
---------------------------------------------

             Summary: Flag in metadata for alternative email bodies
                 Key: TIKA-2455
                 URL: https://issues.apache.org/jira/browse/TIKA-2455
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.16
            Reporter: Matthew Caruana Galizia
            Priority: Minor


When multipart RFC822 emails are being parsed, there's no way to distinguish between alternative versions of the body and attachments.

It would be ideal if some kind of flag were set in the metadata passed to the {{EmbeddedDocumentExtractor}} that indicates that the stream is an alternative.

In GUIs that present the data extracted from the email, alternative bodies can be distinguished from attachments and presented separately.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)