You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "Håvard W. Kongsgård" <h....@niap.no> on 2006/04/15 13:28:22 UTC

How to run bin/nutch dedup when running multiple servers

Hi, I am running nutch 0.7.2 on 3 servers|1 tomcat/db|2 segment servers port 8081|
is it possible to run bin/nutch dedup on multiple servers so that nutch removes all duplicated pages?