You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2011/05/06 07:16:03 UTC
[jira] [Resolved] (TIKA-656) Outlook dates using the wrong metadata
key
[ https://issues.apache.org/jira/browse/TIKA-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nick Burch resolved TIKA-656.
-----------------------------
Resolution: Fixed
Fix Version/s: 1.0
Fixed - the three mail parsers now all output their dates as proper ISO8601 formatted, as Metadata.DATE and Metadata.CREATION_DATE
Also fixed a poifs date extraction as iso8601 issue too
> Outlook dates using the wrong metadata key
> ------------------------------------------
>
> Key: TIKA-656
> URL: https://issues.apache.org/jira/browse/TIKA-656
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 0.9
> Reporter: Nick Burch
> Assignee: Nick Burch
> Fix For: 1.0
>
>
> Currently, the Outlook extractor fetches the "Accepted By Mail Server" date from POI, and then saves this into Metadata.EDIT_TIME and Metadata.LAST_SAVED, neither of which look right, and neither of which are date properties.
> The rfc822 parser uses Metadata.CREATION_DATE, which is a Date property. The mbox parser uses Metadata.DATE, another (but different) Date property
> All three should probably use the same. I'd suggest that for now, they all output the same value to both CREATION_DATE and DATE
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira