You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2022/03/10 19:58:00 UTC
[jira] [Resolved] (TIKA-3698) Duplicate subject/description for Outlook msgs
[ https://issues.apache.org/jira/browse/TIKA-3698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison resolved TIKA-3698.
-------------------------------
Fix Version/s: 2.3.1
Resolution: Fixed
> Duplicate subject/description for Outlook msgs
> ----------------------------------------------
>
> Key: TIKA-3698
> URL: https://issues.apache.org/jira/browse/TIKA-3698
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Trivial
> Fix For: 2.3.1
>
>
> On TIKA-3629, despite our best efforts to simplify and streamline metadata keys, we backed off and continued to include/added back keywords _and_ subject.
> Another area where we should probably include both includes msg files.
> POI's msg.getSubject() is going to "dc:title", and msg.getConversationTopic() is going to "dc:description". Along the lines of what we did on TIKA-3629, I propose adding msg.getConversationTopic() also under the key "dc:subject".
--
This message was sent by Atlassian Jira
(v8.20.1#820001)