You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Паша Минченков <ch...@gmail.com> on 2010/05/31 22:15:40 UTC

Grouping or de-duping

Sorry for my similar questions. I need to remove duplicates from search
results for a given field (or group by). Documents on this field are not
ordered. Which one will get duplicates in search results - I do not care. I
tried to use DuplicateFilter and PerParentLimitedQuery, but they didn't
help. In searching for an answer I found references to
SimpleFacetParameters, but I do not understand how this material can be
useful to me because it refers to the project Solr. Maybe someone has an
example of grouping searh result or something like DeDupinQuery.

On the link below, I found a solution, but there is no sample and I can't
make these modifications my self.
http://markmail.org/message/uvrh3y5ogjgu4gfx#query:group%20lucene%20results%20by%20field+page:1+mid:uvrh3y5ogjgu4gfx+state:results

Thanks.

Re: Grouping or de-duping

Posted by Otis Gospodnetic <ot...@yahoo.com>.
Pasa,

Maybe Field Collapsing (Solr) can help? See SOLR-236 in JIRA

http://search-lucene.com/?q=field+collapsing&fc_project=Lucene&fc_project=Solr

 Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



----- Original Message ----
> From: Паша Минченков <ch...@gmail.com>
> To: java-user@lucene.apache.org
> Sent: Mon, May 31, 2010 4:15:40 PM
> Subject: Grouping or de-duping
> 
> Sorry for my similar questions. I need to remove duplicates from 
> search
results for a given field (or group by). Documents on this field are 
> not
ordered. Which one will get duplicates in search results - I do not care. 
> I
tried to use DuplicateFilter and PerParentLimitedQuery, but they 
> didn't
help. In searching for an answer I found references 
> to
SimpleFacetParameters, but I do not understand how this material can 
> be
useful to me because it refers to the project Solr. Maybe someone has 
> an
example of grouping searh result or something like DeDupinQuery.

On 
> the link below, I found a solution, but there is no sample and I can't
make 
> these modifications my self.

> href="http://markmail.org/message/uvrh3y5ogjgu4gfx#query:group%20lucene%20results%20by%20field+page:1+mid:uvrh3y5ogjgu4gfx+state:results" 
> target=_blank 
> >http://markmail.org/message/uvrh3y5ogjgu4gfx#query:group%20lucene%20results%20by%20field+page:1+mid:uvrh3y5ogjgu4gfx+state:results

Thanks.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org