You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Rosa (Anuncios)" <ro...@gmail.com> on 2011/01/27 12:32:29 UTC

DIH and duplicate content

Hi,

Is there a way to avoid duplicate content in a index at the moment i'm 
uploading my xml feed via DIH?

I would like to have only one entry for a given description. I mean if 
the desciption of one product already exist in index not import this new 
product.

Is there a built in function? Or any hack?

thanks for your help

Rosa


Re: DIH and duplicate content

Posted by Markus Jelsma <ma...@openindex.io>.
http://wiki.apache.org/solr/Deduplication


On Thursday 27 January 2011 12:32:29 Rosa (Anuncios) wrote:
> Is there a way to avoid duplicate content in a index at the moment i'm 
> uploading my xml feed via DIH?
> 
> I would like to have only one entry for a given description. I mean if 
> the desciption of one product already exist in index not import this new 
> product.

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350