You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Jörg Agatz <jo...@googlemail.com> on 2009/06/05 09:13:55 UTC

Only the Newes File as Result

Hallo, Solr users...

I have a Problem!

I Have a lot of files, fome of the Files are exist in more than one version.
often they are only little changes in the files...

Now i musst find a way to get only the last of each file.
The normal Results are maby 500 Documents, but from each document are exist
2 or 3 revisions, nuw i hop to find with your help a way to get, only the
300 - 200 newes Document!


Maby you have an idea for me.

Jörg

Re: Only the Newes File as Result

Posted by Grant Ingersoll <gs...@apache.org>.
On Jun 5, 2009, at 12:13 AM, Jörg Agatz wrote:

> Hallo, Solr users...
>
> I have a Problem!
>
> I Have a lot of files, fome of the Files are exist in more than one  
> version.
> often they are only little changes in the files...
>
> Now i musst find a way to get only the last of each file.
> The normal Results are maby 500 Documents, but from each document  
> are exist
> 2 or 3 revisions, nuw i hop to find with your help a way to get,  
> only the
> 300 - 200 newes Document!
>

Do you ever have a requirement to get at the old revisions?  Because  
if you don't need the old revisions _ever_ then just make sure, during  
indexing, that they file has the same Unique Id and then Solr will  
replace it.

Otherwise, you can look at the field collapsing work going on in JIRA  
(SOLR-236) and also looking at using a Function Query that boosts by  
relevancy.  The latter approach will just help make sure the newer  
results appear on top, but won't exclude the older revisions.

HTH,
Grant

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:
http://www.lucidimagination.com/search