You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/12/27 17:17:50 UTC

[jira] [Commented] (TIKA-1206) rfc822 standard headers

    [ https://issues.apache.org/jira/browse/TIKA-1206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13857543#comment-13857543 ] 

Ken Krugler commented on TIKA-1206:
-----------------------------------

Hi Marco - do you have any test documents you could attach (and even better, unit tests) to validate the above changes? As an example, I don't think I've got any emails on-hand that use the Mozilla custom headers. Thanks!

> rfc822 standard headers
> -----------------------
>
>                 Key: TIKA-1206
>                 URL: https://issues.apache.org/jira/browse/TIKA-1206
>             Project: Tika
>          Issue Type: Improvement
>          Components: detector, mime
>    Affects Versions: 1.4
>            Reporter: Marco Quaranta
>            Priority: Minor
>
> Please add the following headers to message/rfc822 type:
> {noformat}
> <match value="Subject:" type="stringignorecase" offset="0" />	
> <match value="Content-Type:" type="stringignorecase" offset="0">
>   <match value="MIME-Version:" type="stringignorecase" offset="0:500"/>
> </match>
> {noformat}
> The former is defined in [RFC822|http://www.ietf.org/rfc/rfc0822.txt], the latter is defined in [RFC2045|http://tools.ietf.org/html/rfc2045#page-10]. I've added a further check on the Content-type header just to be sure it's a MIME.
> In RFC822 (part 4.1) are also defined other headers like: Resent-From, References, Keyword etc.. 
> Please also consider to add the following Mozilla custom-headers:
> {noformat}
> <match value="X-Account-Key:" type="string" offset="0">
>   <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
> </match>
> <match value="X-Mozilla-Status:" type="string" offset="0">
>   <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
> </match>
> {noformat}
> Thanks



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)