You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2005/07/17 02:05:52 UTC

[Nutch Wiki] Update of "bin/nutch inject" by RobPettengill

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.

The following page has been changed by RobPettengill:
http://wiki.apache.org/nutch/bin/nutch_inject

New page:
inject is an alias for net.nutch.db.!WebDBInjector

This class takes a flat file of URLs and adds them as entries into a web page & link db. Useful for bootstrapping the system.

Usage: bin/nutch net.nutch.db.!WebDBInjector (-local | -ndfs <namenode:port>) <db_dir> (-urlfile <url_file> | -dmozfile <dmoz_file>) [-subset <subsetDenominator>] [-includeAdultMaterial] [-skew skew] [-noDmozDesc] [-topicFile <topic list file>] [-topic <topic> [-topic <topic> [...]]]

[[CommandLineOptions]]