You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@nutch.apache.org by Des Sant <sa...@gmail.com> on 2007/06/22 17:30:12 UTC

slow distributed crawling

hello

I tried to start crawl on a single machine, but with ditributed
configuration (single machine as master and slave at the same time).
Server communicates with itself throgh ssh.
It works and it crawls, but with very bad performance, much slower than
with local crawl on the same machine. Is it due to the hadoop overhead
or did I something wrong?


thanks for help