You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2015/03/15 23:38:38 UTC
[jira] [Updated] (TIKA-1206) rfc822 standard headers
[ https://issues.apache.org/jira/browse/TIKA-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tyler Palsulich updated TIKA-1206:
----------------------------------
Labels: magic new-parser (was: )
> rfc822 standard headers
> -----------------------
>
> Key: TIKA-1206
> URL: https://issues.apache.org/jira/browse/TIKA-1206
> Project: Tika
> Issue Type: Improvement
> Components: detector, mime
> Affects Versions: 1.4
> Reporter: Marco Quaranta
> Priority: Minor
> Labels: magic, new-parser
> Attachments: example1.eml, example2.eml, example3.eml
>
>
> Please add the following headers to message/rfc822 type:
> {noformat}
> <match value="Subject:" type="stringignorecase" offset="0" />
> <match value="Content-Type:" type="stringignorecase" offset="0">
> <match value="MIME-Version:" type="stringignorecase" offset="0:500"/>
> </match>
> {noformat}
> The former is defined in [RFC822|http://www.ietf.org/rfc/rfc0822.txt], the latter is defined in [RFC2045|http://tools.ietf.org/html/rfc2045#page-10]. I've added a further check on the Content-type header just to be sure it's a MIME.
> In RFC822 (part 4.1) are also defined other headers like: Resent-From, References, Keyword etc..
> Please also consider to add the following Mozilla custom-headers:
> {noformat}
> <match value="X-Account-Key:" type="string" offset="0">
> <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
> </match>
> <match value="X-Mozilla-Status:" type="string" offset="0">
> <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
> </match>
> {noformat}
> Thanks
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)