You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2005/07/17 01:54:55 UTC

[Nutch Wiki] Update of "bin/nutch datanode" by RobPettengill

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by RobPettengill:
http://wiki.apache.org/nutch/bin/nutch_datanode

New page:
datanode is an alias for net.nutch.ndfs.NDFS

The NDFS class holds the NDFS client and server.

DataNode controls just one critical table: block-> BLOCK_SIZE stream of bytes

This info is stored on disk (the NameNode is responsible for asking other machines to replicate the data). The DataNode reports the table's contents to the NameNode upon startup and every so often afterwards.

Usage: bin/nutch net.nutch.ndfs.NDFS <dataDir> <localMachine> <namenode:port>

[CommandLineOptions]