You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2011/07/02 00:30:05 UTC

[Nutch Wiki] Trivial Update of "bin/nutch_fetch" by LewisJohnMcgibbney

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The "bin/nutch_fetch" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/bin/nutch_fetch?action=diff&rev1=5&rev2=6

Comment:
trivial update

  
  '''[-threads n]''': This arguement invokes the number of threads we wish to work concurrently on fetching URLs in the desired segment e.g. the number of fetcher threads the fetcher should use. This is also determines the maximum number of requests that are made at once (each fetcher thread handles one connection).
  
- '''[-noParsing]''': If no arguement is passed this value is the default. This is the case due to errors which can occur when parsing segments. If errors occur then the results of the whole fetching process can be corrupted. Note that parsing will only follow meta-redirects coming from the original URL.
+ '''[-noParsing]''': If no arguement is passed this value is the default, as set in nutch-default.xml. This is the case due to errors which can occur when parsing segments. If parsing errors occur then the results of the whole fetching process can be corrupted. Note that parsing will only follow meta-redirects coming from the original URL.
  
  CommandLineOptions