You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2012/06/12 12:57:52 UTC
[Nutch Wiki] Update of "bin/nutch solrindex" by MarkusJelsma
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "bin/nutch solrindex" page has been changed by MarkusJelsma:
http://wiki.apache.org/nutch/bin/nutch%20solrindex?action=diff&rev1=4&rev2=5
Usage:
{{{
- bin/nutch solrindex <solr url> <crawldb> -linkdb <linkdb> (<segment> ... | -dir <segments>)
+ bin/nutch solrindex <solr url> <crawldb> [-linkdb <linkdb>] [-params k1=v1&k2=v2...] (<segment> ... | -dir <segments>) [-noCommit] [-deleteGone] [-filter] [-normalize]
}}}
'''<solr url>''': This is the HTTP solr instance you wish to index data with. e.g. ''http://localhost:8983/solr''
@@ -25, +25 @@
'''[-deleteGone]''': Delete the gone pages and permanent redirects of the input segment(s).
+ '''[-filter]''': Enable URL filtering.
+
+ '''[-normalize]''': Enable URL normalizing.
CommandLineOptions