You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by John Seer <pu...@yahoo.com> on 2009/07/07 20:31:10 UTC
Multi Value field
Hello,
I have 100k index with documents with one searchable field in it. That
field has multiple values for example
doc( search: X search : X Y search: X Y Z id:1)
doc( search: X Y K id:2)
I am using Standard Analyzer for building and searching, and having problem
with scores
if the term is "X Y" the document 2 is getting higher score then document 1.
Please can someone suggest way that doc 1 will score higher?
Maybe I need to use some specific query type?
Thanks
--
View this message in context: http://www.nabble.com/Multi-Value-field-tp24378975p24378975.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Multi Value field
Posted by Mark Harwood <ma...@yahoo.co.uk>.
> I just try norms idea as well no change
You'll need to look at searcher.explain() for the two docs or post a
Junit or code example that can be executed which shows the issue
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Multi Value field
Posted by John Seer <pu...@yahoo.com>.
I already tried to use custom similarity (I set all methods to return 1f)-
doesn't work.
I just try norms idea as well no change
markharw00d wrote:
>
>> if the term is "X Y" the document 2 is getting higher score then
>> document 1.
>
>
> That may be length normalisation at play. Doc 2 is shorter so may be
> seen as a better match for that reason.
> Using the "explain" function helps illustrate the break down of scores
> in matches.
>
> You could try indexing that field without "norms" or use a custom
> Similarity at query time to ignore the length normalisation factor.
>
> Cheers
> Mark
>
>
>
> On 7 Jul 2009, at 19:31, John Seer wrote:
>
>>
>> Hello,
>>
>> I have 100k index with documents with one searchable field in it.
>> That
>> field has multiple values for example
>>
>> doc( search: X search : X Y search: X Y Z id:1)
>> doc( search: X Y K id:2)
>>
>> I am using Standard Analyzer for building and searching, and having
>> problem
>> with scores
>>
>> if the term is "X Y" the document 2 is getting higher score then
>> document 1.
>>
>>
>> Please can someone suggest way that doc 1 will score higher?
>> Maybe I need to use some specific query type?
>>
>> Thanks
>>
>>
>>
>> --
>> View this message in context:
>> http://www.nabble.com/Multi-Value-field-tp24378975p24378975.html
>> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
--
View this message in context: http://www.nabble.com/Multi-Value-field-tp24378975p24381024.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Multi Value field
Posted by Mark Harwood <ma...@yahoo.co.uk>.
> if the term is "X Y" the document 2 is getting higher score then
> document 1.
That may be length normalisation at play. Doc 2 is shorter so may be
seen as a better match for that reason.
Using the "explain" function helps illustrate the break down of scores
in matches.
You could try indexing that field without "norms" or use a custom
Similarity at query time to ignore the length normalisation factor.
Cheers
Mark
On 7 Jul 2009, at 19:31, John Seer wrote:
>
> Hello,
>
> I have 100k index with documents with one searchable field in it.
> That
> field has multiple values for example
>
> doc( search: X search : X Y search: X Y Z id:1)
> doc( search: X Y K id:2)
>
> I am using Standard Analyzer for building and searching, and having
> problem
> with scores
>
> if the term is "X Y" the document 2 is getting higher score then
> document 1.
>
>
> Please can someone suggest way that doc 1 will score higher?
> Maybe I need to use some specific query type?
>
> Thanks
>
>
>
> --
> View this message in context: http://www.nabble.com/Multi-Value-field-tp24378975p24378975.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org