You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2006/03/05 00:11:43 UTC
[Nutch Wiki] Update of "nutch-0.8-dev/bin/nutch readdb" by JeffRitchie
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by JeffRitchie:
http://wiki.apache.org/nutch/nutch-0%2e8-dev/bin/nutch_readdb
New page:
= "readdb" is an alias for "org.apache.nutch.crawl.CrawlDbReader" =
== Returns or Exports information on the Crawl Database ==
=== Usage ===
nutch-0.8-dev/bin/nutch org.apache.nutch.crawl.CrawlDbReader <crawldb> (-stats | -dump <out_dir> | -url <url>)
'''<crawldb>:''' Path to the crawldb directory. Default Value: ''None''[[BR]]
'''[-stats]:''' Prints the overall statistics to System.out[[BR]]
'''[-dump <out_dir>]:''' Exports the crawldb to a file in <out_dir>[[BR]]
'''[-url <url>]:''' Prints statistics on <url> to System.out[[BR]]
=== Configuration Files ===
hadoop-default.xml[[BR]]
hadoop-site.xml[[BR]]
=== Other Files ===
None.
=== Caveats and Notes ===
None.
DevelopmentCommandLineOptions