You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/04/13 21:36:06 UTC

[jira] [Resolved] (NUTCH-982) Remove copying of ID and URL field in solrmapping

     [ https://issues.apache.org/jira/browse/NUTCH-982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma resolved NUTCH-982.
---------------------------------

    Resolution: Fixed
      Assignee: Markus Jelsma

Committed for trunk in rev 1091895.

> Remove copying of ID and URL field in solrmapping
> -------------------------------------------------
>
>                 Key: NUTCH-982
>                 URL: https://issues.apache.org/jira/browse/NUTCH-982
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 2.0
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 2.0
>
>         Attachments: NUTCH-982.patch
>
>
> Guys, the Solrindexer seems to be broken in trunk. With current solrmapping and code you'll get an exception complaining about multiple values in a non-multivalued field; the ID field which must of course be single valued. This happens because of the current mapping code and mapping config copy the url and id fields. The old 1.3 NutchDocument does not contain an ID field but in trunk it does.
> I propose to change the current solrmapping configuration by simply removing:
>                 <field dest="id" source="url"/>
>                 <copyField source="url" dest="url"/>
> If not, we need to do something about the solrmapping code.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira