You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/03/07 00:20:20 UTC
[Nutch Wiki] Trivial Update of "nutch-0.8-dev/bin/nutch index" by JeffRitchie
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by JeffRitchie:
http://wiki.apache.org/nutch/nutch-0%2e8-dev/bin/nutch_index
The comment on the change is:
new page added
New page:
= "index" is an alias for "org.apache.nutch.indexer.Indexer" =
== Creates an index of a segment using information from the crawldb and the linkdb to score pages in the index. ==
=== Usage ===
nutch-0.8-dev/bin/nutch org.apache.nutch.indexer.Indexer <index> <crawldb> <linkdb> <segment> ...
'''<index>:''' Path to the directory where the index will be created.[[BR]]
'''<crawldb>:''' Path to the crawl database directory.[[BR]]
'''<linkdb>:''' Path to the link database directory.[[BR]]
'''<segment>:''' Path to the segment that has been fetched. More then one segment may be specified.[[BR]]
=== Configuration Files ===
hadoop-default.xml[[BR]]
hadoop-site.xml[[BR]]
nutch-default.xml[[BR]]
nutch-site.xml[[BR]]
=== Other Files ===
None.
=== Caveats and Notes ===
None.
DevelopmentCommandLineOptions