You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Patrick Markiewicz <pm...@sim-gtech.com> on 2008/07/14 23:18:52 UTC
Dedup Details
Hi,
I was curious as to what file the dedup process actually
saves? Is it predictable? Is there a way to give preference to a
specific domain? Thanks.
Patrick