You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Patrick Markiewicz <pm...@sim-gtech.com> on 2008/07/14 23:18:52 UTC

Dedup Details

Hi,

            I was curious as to what file the dedup process actually
saves?  Is it predictable?  Is there a way to give preference to a
specific domain?  Thanks.

 

Patrick