You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2015/03/15 23:38:38 UTC

[jira] [Updated] (TIKA-1206) rfc822 standard headers

     [ https://issues.apache.org/jira/browse/TIKA-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tyler Palsulich updated TIKA-1206:
----------------------------------
    Labels: magic new-parser  (was: )

> rfc822 standard headers
> -----------------------
>
>                 Key: TIKA-1206
>                 URL: https://issues.apache.org/jira/browse/TIKA-1206
>             Project: Tika
>          Issue Type: Improvement
>          Components: detector, mime
>    Affects Versions: 1.4
>            Reporter: Marco Quaranta
>            Priority: Minor
>              Labels: magic, new-parser
>         Attachments: example1.eml, example2.eml, example3.eml
>
>
> Please add the following headers to message/rfc822 type:
> {noformat}
> <match value="Subject:" type="stringignorecase" offset="0" />	
> <match value="Content-Type:" type="stringignorecase" offset="0">
>   <match value="MIME-Version:" type="stringignorecase" offset="0:500"/>
> </match>
> {noformat}
> The former is defined in [RFC822|http://www.ietf.org/rfc/rfc0822.txt], the latter is defined in [RFC2045|http://tools.ietf.org/html/rfc2045#page-10]. I've added a further check on the Content-type header just to be sure it's a MIME.
> In RFC822 (part 4.1) are also defined other headers like: Resent-From, References, Keyword etc.. 
> Please also consider to add the following Mozilla custom-headers:
> {noformat}
> <match value="X-Account-Key:" type="string" offset="0">
>   <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
> </match>
> <match value="X-Mozilla-Status:" type="string" offset="0">
>   <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
> </match>
> {noformat}
> Thanks



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)