You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2005/07/17 01:13:57 UTC
[Nutch Wiki] Trivial Update of "bin/nutch generate" by RobPettengill
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by RobPettengill:
http://wiki.apache.org/nutch/bin/nutch_generate
------------------------------------------------------------------------------
- generate is an alias for net.nutch.tools.FetchListTool
+ generate is an alias for net.nutch.tools.!FetchListTool
The generate command is used to create a new fetchlist from the webdb which contains urls which can be fetched using the fetch tool.
- Usage: bin/nutch net.nutch.tools.FetchListTool (-local | -ndfs <namenode:port>)
+ Usage: bin/nutch net.nutch.tools.!FetchListTool (-local | -ndfs <namenode:port>)
<db> <segment_dir> [-refetchonly] [-anchoroptimize linkdb] [-topN N]
[-cutoff cutoffscore] [-numFetchers numFetchers] [-adddays numDays]
@@ -18, +18 @@
urls with the highest score - potentially the most interesting ones,
which should be prioritized in fetching.
+ CommandLineOptions
- Juho Mäkinen