You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2011/12/08 14:23:46 UTC
[Nutch Wiki] Trivial Update of "NutchTutorial" by LewisJohnMcgibbney
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "NutchTutorial" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/NutchTutorial?action=diff&rev1=54&rev2=55
* run the Solr Index command:
{{{
- bin/nutch solrindex http://127.0.0.1:8983/solr/ crawldb crawldb/linkdb crawldb/segments/*
+ bin/nutch solrindex http://127.0.0.1:8983/solr/ crawldb -linkdb crawldb/linkdb crawldb/segments/*
}}}
+
+ The call signature for running the solrindex has changed. The linkdb is now optional, so you need to denote it with a "-linkdb" flag on the command line.
+
This will send all crawl data to Solr for indexing. For more information please see [[bin/nutch solrindex]]
If all has gone to plan, we are now ready to search with http://localhost:8983/solr/admin/. If you want to see the raw HTML indexed by Solr, change the content field definition in `schema.xml` to: