You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/04/13 14:49:05 UTC

[jira] [Commented] (NUTCH-982) Remove copying of ID and URL field in solrmapping

    [ https://issues.apache.org/jira/browse/NUTCH-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13019299#comment-13019299 ] 

Markus Jelsma commented on NUTCH-982:
-------------------------------------

If there are no objection i'll commit this today. 

> Remove copying of ID and URL field in solrmapping
> -------------------------------------------------
>
>                 Key: NUTCH-982
>                 URL: https://issues.apache.org/jira/browse/NUTCH-982
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 2.0
>            Reporter: Markus Jelsma
>             Fix For: 2.0
>
>         Attachments: NUTCH-982.patch
>
>
> Guys, the Solrindexer seems to be broken in trunk. With current solrmapping and code you'll get an exception complaining about multiple values in a non-multivalued field; the ID field which must of course be single valued. This happens because of the current mapping code and mapping config copy the url and id fields. The old 1.3 NutchDocument does not contain an ID field but in trunk it does.
> I propose to change the current solrmapping configuration by simply removing:
>                 <field dest="id" source="url"/>
>                 <copyField source="url" dest="url"/>
> If not, we need to do something about the solrmapping code.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira