You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2011/05/06 07:16:03 UTC

[jira] [Resolved] (TIKA-656) Outlook dates using the wrong metadata key

     [ https://issues.apache.org/jira/browse/TIKA-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Burch resolved TIKA-656.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 1.0

Fixed - the three mail parsers now all output their dates as proper ISO8601 formatted, as Metadata.DATE and Metadata.CREATION_DATE

Also fixed a poifs date extraction as iso8601 issue too

> Outlook dates using the wrong metadata key
> ------------------------------------------
>
>                 Key: TIKA-656
>                 URL: https://issues.apache.org/jira/browse/TIKA-656
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>             Fix For: 1.0
>
>
> Currently, the Outlook extractor fetches the "Accepted By Mail Server" date from POI, and then saves this into Metadata.EDIT_TIME and Metadata.LAST_SAVED, neither of which look right, and neither of which are date properties.
> The rfc822 parser uses Metadata.CREATION_DATE, which is a Date property. The mbox parser uses Metadata.DATE, another (but different) Date property
> All three should probably use the same. I'd suggest that for now, they all output the same value to both CREATION_DATE and DATE

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira