You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2022/03/10 19:58:00 UTC

[jira] [Resolved] (TIKA-3698) Duplicate subject/description for Outlook msgs

     [ https://issues.apache.org/jira/browse/TIKA-3698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Allison resolved TIKA-3698.
-------------------------------
    Fix Version/s: 2.3.1
       Resolution: Fixed

> Duplicate subject/description for Outlook msgs
> ----------------------------------------------
>
>                 Key: TIKA-3698
>                 URL: https://issues.apache.org/jira/browse/TIKA-3698
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Trivial
>             Fix For: 2.3.1
>
>
> On TIKA-3629, despite our best efforts to simplify and streamline metadata keys, we backed off and continued to include/added back keywords _and_ subject.
> Another area where we should probably include both includes msg files.
> POI's msg.getSubject() is going to "dc:title", and msg.getConversationTopic() is going to "dc:description".  Along the lines of what we did on TIKA-3629, I propose adding msg.getConversationTopic() also under the key "dc:subject".



--
This message was sent by Atlassian Jira
(v8.20.1#820001)