You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2018/10/15 13:34:36 UTC

[Nutch Wiki] Update of "NutchTutorial" by SebastianNagel

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "NutchTutorial" page has been changed by SebastianNagel:
https://wiki.apache.org/nutch/NutchTutorial?action=diff&rev1=93&rev2=94

Comment:
NUTCH-2656 Solr setup updated for Solr 7.x

  || 1.13  || 5.5.0  ||
  || 1.12  || 5.4.1  ||
  
- To install Solr:
+ To install Solr 7.x:
   * download binary file from [[http://www.apache.org/dyn/closer.cgi/lucene/solr/|here]]
   * unzip to `$HOME/apache-solr`, we will now refer to this as `${APACHE_SOLR_HOME}`
-  * create resources for a new nutch solr core `cp -r ${APACHE_SOLR_HOME}/server/solr/configsets/basic_configs ${APACHE_SOLR_HOME}/server/solr/configsets/nutch`
+  * create resources for a new nutch solr core {{{
+ mkdir -p ${APACHE_SOLR_HOME}/server/solr/configsets/nutch/
+ cp -r ${APACHE_SOLR_HOME}/server/solr/configsets/_default/* ${APACHE_SOLR_HOME}/server/solr/configsets/nutch/
+ }}}
+  * copy the nutch schema.xml into the `conf` directory {{{
-  * copy the nutch schema.xml into the `conf` directory `cp ${NUTCH_RUNTIME_HOME}/conf/schema.xml ${APACHE_SOLR_HOME}/server/solr/configsets/nutch/conf`
+ cp ${NUTCH_RUNTIME_HOME}/conf/schema.xml ${APACHE_SOLR_HOME}/server/solr/configsets/nutch/conf/
-  * make sure that there is no `managed-schema` "in the way": `rm ${APACHE_SOLR_HOME}/server/solr/configsets/nutch/conf/managed-schema`
-  * start the solr server `${APACHE_SOLR_HOME}/bin/solr start`
+ }}}
+ You may try to use the most recent [[https://github.com/apache/nutch/blob/master/conf/schema.xml|schema.xml]] in case of issues launching Solr with this schema.
+  * make sure that there is no [[https://lucene.apache.org/solr/guide/7_5/schema-factory-definition-in-solrconfig.html#SchemaFactoryDefinitioninSolrConfig-SolrUsesManagedSchemabyDefault|managed-schema]] "in the way": {{{
+ rm ${APACHE_SOLR_HOME}/server/solr/configsets/nutch/conf/managed-schema
+ }}}
+  * start the solr server {{{
+ ${APACHE_SOLR_HOME}/bin/solr start
+ }}}
+  * create the nutch core {{{
-  * create the nutch core `${APACHE_SOLR_HOME}/bin/solr create -c nutch -d server/solr/configsets/nutch/conf/`
+ ${APACHE_SOLR_HOME}/bin/solr create -c nutch -d ${APACHE_SOLR_HOME}/server/solr/configsets/nutch/conf/
+ }}}
  
  After that you need to point Nutch to the Solr instance:
   * (Nutch 1.15 and later) edit the file `conf/index-writers.xml`, see IndexWriters