You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/04/21 14:41:59 UTC

[jira] [Commented] (TIKA-1295) Make some Dublin Core items multi-valued

    [ https://issues.apache.org/jira/browse/TIKA-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14504884#comment-14504884 ] 

Tim Allison commented on TIKA-1295:
-----------------------------------

[~lewismc], +1 to adding potential for hierarchical metadata on TIKA-1607.  We should ensure during the transition (and maybe forever), that users can still get strings fairly easily.

> Make some Dublin Core items multi-valued
> ----------------------------------------
>
>                 Key: TIKA-1295
>                 URL: https://issues.apache.org/jira/browse/TIKA-1295
>             Project: Tika
>          Issue Type: Bug
>          Components: metadata
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Minor
>             Fix For: 1.9
>
>
> According to: http://www.pdfa.org/2011/08/pdfa-metadata-xmp-rdf-dublin-core, dc:title, dc:description and dc:rights should allow multiple values because of language alternatives.  Unless anyone objects in the next few days, I'll switch those to Property.toInternalTextBag() from Property.toInternalText().  I'll also modify PDFParser to extract dc:rights.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)