You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@nutch.apache.org by Robert Young <bu...@gmail.com> on 2007/07/04 18:03:39 UTC

Problem merging Lucene index

Is it possible for me to dedup a Lucene index on a Hadoop filsystem
against a finished Lucene index?

I build up my index with Nutch as per normal, but I would like to
inject single urls and merge the result into the final index without
having to run a full crawl.

Cheers
Rob