You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by patrik <pa...@clipblast.com> on 2007/06/19 09:37:32 UTC

Different config files for different jobs

I've been running multiple standalone machines with some different
config files and very different crawldb for a while now. However I'd
like to start running them all distributed over using the same cluster.
Do configuration files, specifically nutch-site.xml *-urlfilter.txt get
read at the beginning of a job on all machines? For things like generate
which immediately run partion afterwards, does the partition job pick up
the same config as the generate job, or are they read again from the
filesystem?


patrik