You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/03/05 00:11:43 UTC

[Nutch Wiki] Update of "nutch-0.8-dev/bin/nutch readdb" by JeffRitchie

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by JeffRitchie:
http://wiki.apache.org/nutch/nutch-0%2e8-dev/bin/nutch_readdb

New page:
= "readdb" is an alias for "org.apache.nutch.crawl.CrawlDbReader" =

== Returns or Exports information on the Crawl Database ==

=== Usage ===
 nutch-0.8-dev/bin/nutch org.apache.nutch.crawl.CrawlDbReader <crawldb> (-stats | -dump <out_dir> | -url <url>)

  '''<crawldb>:''' Path to the crawldb directory.  Default Value: ''None''[[BR]]
  '''[-stats]:''' Prints the overall statistics to System.out[[BR]]
  '''[-dump <out_dir>]:''' Exports the crawldb to a file in <out_dir>[[BR]]
  '''[-url <url>]:''' Prints statistics on <url> to System.out[[BR]]

=== Configuration Files ===
 hadoop-default.xml[[BR]]
 hadoop-site.xml[[BR]]

=== Other Files ===
 None.

=== Caveats and Notes ===
 None.

DevelopmentCommandLineOptions