You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/03/04 22:47:49 UTC

[Nutch Wiki] Update of "DevelopmentCommandLineOptions" by JeffRitchie

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by JeffRitchie:
http://wiki.apache.org/nutch/DevelopmentCommandLineOptions

The comment on the change is:
New Page - Removed Options no longer in nutch script.

New page:
= Command Line Options of nutch-0.8-dev/bin/nutch =

See each entry for datails of the command arguments and options.

||'''command'''||'''function'''||
||["nutch-0.8-dev/bin/nutch crawl"]||One-step crawler for intranets||
||["nutch-0.8-dev/bin/nutch readdb"]||Read / dump crawldb||
||["nutch-0.8-dev/bin/nutch readlinkdb"]||Read / dump linkdb||
||["nutch-0.8-dev/bin/nutch inject"]||Inject new urls into the crawldb||
||["nutch-0.8-dev/bin/nutch generate"]||Generate new segments to fetch||
||["nutch-0.8-dev/bin/nutch fetch"]||Fetch a segment's pages||
||["nutch-0.8-dev/bin/nutch parse"]||Parse contents in one segment||
||["nutch-0.8-dev/bin/nutch segread"]||Read data in an existing segment||
||["nutch-0.8-dev/bin/nutch updatedb"]||Updates the crawldb from a segment||
||["nutch-0.8-dev/bin/nutch invertlinks"]||Create or update a linkdb from a segment or segments||
||["nutch-0.8-dev/bin/nutch index"]||Run the indexer on a segment's fetcher output||
||["nutch-0.8-dev/bin/nutch merge"]||Merge several segment indexes||
||["nutch-0.8-dev/bin/nutch dedup"]||Deletes duplicate documents in a set of segment indexes||
||["nutch-0.8-dev/bin/nutch plugin"]||Load a plugin and run one of its classes main()||
||["nutch-0.8-dev/bin/nutch server"]||Run a search server||
||                          ||                                               ||