You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2005/07/17 01:13:57 UTC

[Nutch Wiki] Trivial Update of "bin/nutch generate" by RobPettengill

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by RobPettengill:
http://wiki.apache.org/nutch/bin/nutch_generate

------------------------------------------------------------------------------
- generate is an alias for net.nutch.tools.FetchListTool
+ generate is an alias for net.nutch.tools.!FetchListTool
  
  The generate command is used to create a new fetchlist from the webdb which contains urls which can be fetched using the fetch tool.
  
- Usage: bin/nutch net.nutch.tools.FetchListTool (-local | -ndfs <namenode:port>) 
+ Usage: bin/nutch net.nutch.tools.!FetchListTool (-local | -ndfs <namenode:port>) 
    <db> <segment_dir> [-refetchonly] [-anchoroptimize linkdb] [-topN N] 
    [-cutoff cutoffscore] [-numFetchers numFetchers] [-adddays numDays]
  
@@ -18, +18 @@

  urls with the highest score - potentially the most interesting ones,
  which should be prioritized in fetching.
  
+ CommandLineOptions
  
   - Juho Mäkinen