You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2019/06/28 21:39:00 UTC

[jira] [Commented] (TIKA-2898) wrong email send date being set in OutlookPSTParser

    [ https://issues.apache.org/jira/browse/TIKA-2898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16875245#comment-16875245 ] 

Hudson commented on TIKA-2898:
------------------------------

UNSTABLE: Integrated in Jenkins build tika-2.x-windows #427 (See [https://builds.apache.org/job/tika-2.x-windows/427/])
TIKA-2898 -- add client submit time for pst (tallison: rev 06e734b3cfb192b96581338281c0373ef385d3cf)
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/mbox/OutlookPSTParserTest.java
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/mbox/OutlookPSTParser.java
* (edit) tika-core/src/main/java/org/apache/tika/metadata/Office.java
TIKA-2898 -- fix for master vs branch_1x diff (tallison: rev 9a8ef5e7844df2569494b27f51e28047a5596018)
* (edit) tika-core/src/main/java/org/apache/tika/metadata/Office.java


> wrong email send date being set in OutlookPSTParser 
> ----------------------------------------------------
>
>                 Key: TIKA-2898
>                 URL: https://issues.apache.org/jira/browse/TIKA-2898
>             Project: Tika
>          Issue Type: Bug
>          Components: handler
>    Affects Versions: 1.21
>            Reporter: Paul Woods
>            Assignee: Tim Allison
>            Priority: Major
>         Attachments: albert_meyers_000_1_1.pst
>
>
> When parsing a pst file from the enron data set I get an email created date of May 2009 from the OutlookPSTParser for all emails.  Looking at the underlying PSTMessage coming up from libpst I can see that the createdDate field has indeed got that value - I'm not sure where that date is coming from, but the Enron data set was released in 2002. 
> I can see that the pst message has the correct date value in the clientSubmitTime field.
>  getClientSubmitTime() is not used when setting metadata in the OutlookPSTParser.
> Can the result of this call be added to the metadata?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)