You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Lutz (JIRA)" <ji...@apache.org> on 2011/02/10 14:45:57 UTC

[jira] Created: (TIKA-595) HtmlHandler does not support multivalue metadata

HtmlHandler does not support multivalue metadata
------------------------------------------------

                 Key: TIKA-595
                 URL: https://issues.apache.org/jira/browse/TIKA-595
             Project: Tika
          Issue Type: Bug
          Components: metadata, parser
    Affects Versions: 0.8
            Reporter: Lutz
            Priority: Minor


The HtmlParser uses metadata.set(...). So META tags that occure more than once are not handled correctly (DublinCore metadata can be set more than once).
The parser should use  metadata.add(..) instead.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-595) HtmlHandler does not support multivalue metadata

Posted by "Michael Kilgore (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13437057#comment-13437057 ] 

Michael Kilgore commented on TIKA-595:
--------------------------------------

Is there a work around for this limitation? I assume there is given the Minor priority that has been assigned. I have multiple metadata tags similar to DublinCore, and the use of metadata.set is not working.

Thanks,
                
> HtmlHandler does not support multivalue metadata
> ------------------------------------------------
>
>                 Key: TIKA-595
>                 URL: https://issues.apache.org/jira/browse/TIKA-595
>             Project: Tika
>          Issue Type: Bug
>          Components: metadata, parser
>    Affects Versions: 0.8
>            Reporter: Lutz Pumpenmeier
>            Priority: Minor
>
> The HtmlHandler uses metadata.set(...). So META tags that occure more than once are not handled correctly (DublinCore metadata can be set more than once).
> The handler should use  metadata.add(..) instead.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira