You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Craig Stadler <cs...@hotmail.com> on 2008/11/04 00:52:09 UTC

Question about score...

We have one field that is a simple text field, not multivalue.

<field name="content0"   type="text"    indexed="true" stored="false" multiValued="false"  />
 <defaultSearchField>content0</defaultSearchField>
 <solrQueryParser defaultOperator="OR"/>

We are populating music, atrist song etc in one string.

content0:(blues brothers)







Returns : (default desc score)
 
BluesBrothers01.mp3
Breaux_Brothers_Tiger_Rag_Blues.mp3
Blues Brothers - Theme From Rawhide V1.mp3 

Why in the world is result 2 higher in score than #3 ???
Is there something we can set in our schema or sol config to change this..
Ideally we want all the Blues Brothers to appear with higher score because of word order and proximity to the beginning of the string, etc.

Many thanks and appreciation to anyone who can shed light on this.

-Craig

_________________________________________________________________
When your life is on the go—take your life with you.
http://clk.atdmt.com/MRT/go/115298558/direct/01/

Re: Question about score...

Posted by Otis Gospodnetic <ot...@yahoo.com>.
Hi,

You could look at the scoring explanation with &debugQuery=true, and I think you'd see that this is because of the TF (term frequency) for terms blues and brothers.  You can think/visualize this as "two for two" for that first hit - the field has 2 terms and both of them match your search terms 2 of 2 terms match, and that looks good to the scoring engine.  The other docs/fields have a smaller percentage of words matching.

Otis--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Craig Stadler <cs...@hotmail.com>
> To: solr-user@lucene.apache.org
> Sent: Monday, November 3, 2008 6:52:09 PM
> Subject: Question about score...
> 
> 
> We have one field that is a simple text field, not multivalue.
> 
> 
> multiValued="false"  />
> content0
> 
> 
> We are populating music, atrist song etc in one string.
> 
> content0:(blues brothers)
> 
> 
> 
> 
> 
> 
> 
> Returns : (default desc score)
> 
> BluesBrothers01.mp3
> Breaux_Brothers_Tiger_Rag_Blues.mp3
> Blues Brothers - Theme From Rawhide V1.mp3 
> 
> Why in the world is result 2 higher in score than #3 ???
> Is there something we can set in our schema or sol config to change this..
> Ideally we want all the Blues Brothers to appear with higher score because of 
> word order and proximity to the beginning of the string, etc.
> 
> Many thanks and appreciation to anyone who can shed light on this.
> 
> -Craig
> 
> _________________________________________________________________
> When your life is on the go—take your life with you.
> http://clk.atdmt.com/MRT/go/115298558/direct/01/


Re: Question about score...

Posted by Yonik Seeley <yo...@apache.org>.
On Mon, Nov 3, 2008 at 6:52 PM, Craig Stadler <cs...@hotmail.com> wrote:
> BluesBrothers01.mp3
> Breaux_Brothers_Tiger_Rag_Blues.mp3
> Blues Brothers - Theme From Rawhide V1.mp3
>
> Why in the world is result 2 higher in score than #3 ???
> Is there something we can set in our schema or sol config to change this..
> Ideally we want all the Blues Brothers to appear with higher score because of word order and proximity to the beginning of the string, etc.

Boolean queries have no implicit proximity... you either need to add
it yourself or use a query parser like dismax that can add it for you.
 If you want to add it yourself, use something like "blues
brothers"~100

-Yonik