You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2014/02/22 00:15:19 UTC

[jira] [Created] (TIKA-1244) Better parsing of Mbox files

Luis Filipe Nassif created TIKA-1244:
----------------------------------------

             Summary: Better parsing of Mbox files
                 Key: TIKA-1244
                 URL: https://issues.apache.org/jira/browse/TIKA-1244
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.5
            Reporter: Luis Filipe Nassif


MboxParser currently looses metadata of all emails, except first. It does not extract/parse emails, nor decode parts. It should handle embedded emails like other container parsers do, so emails will be automatically parsed by RFC822Parser. I will try to add a patch for this.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)