You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/11/25 22:00:39 UTC

[jira] Commented: (NUTCH-760) Allow field mapping from nutch to solr index

    [ https://issues.apache.org/jira/browse/NUTCH-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782617#action_12782617 ] 

Andrzej Bialecki  commented on NUTCH-760:
-----------------------------------------

I reworked the patch to get rid of any left-overs of static Configuration, and changed the concept of "schema" (which was misleading) to "mapping" throughout the patch and class names.

This is now committed in rev. 884269 - thanks!

> Allow field mapping from nutch to solr index
> --------------------------------------------
>
>                 Key: NUTCH-760
>                 URL: https://issues.apache.org/jira/browse/NUTCH-760
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: David Stuart
>            Assignee: Andrzej Bialecki 
>             Fix For: 1.1
>
>         Attachments: solrindex_schema.patch, solrindex_schema.patch, solrindex_schema.patch, solrindex_schema.patch
>
>
> I am using nutch to crawl sites and have combined it
> with solr pushing the nutch index using the solrindex command. I have
> set it up as specified on the wiki using the copyField url to id in the
> schema. Whilst this works fine it is stuff's up my inputs from other
> sources in solr (e.g. using the solr data import handler) as they have
> both id's and url's. I have patch that implements a nutch xml schema
> defining what basic nutch fields map to in your solr push.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.