You are viewing a plain text version of this content. The canonical link for it is here.

Posted to java-user@lucene.apache.org by Jeff Wallace <jl...@us.ibm.com> on 2017/07/18 16:09:26 UTC

Unexpected scoring results

On a legacy product that is still based upon Lucene-3.6.2, we (or our 
customers) occasionally encounter a situation like this:
For what ever reason, a customer causes more than one duplicate source 
document to be ingested into the same index.
A subsequent query whose criteria selects these duplicate documents can 
sometimes report score values that differ considerable for the supposedly 
duplicate content?

Searching through some of the older Lucene mail archives I did notice what 
I believe to be discussions concerning development test failures having to 
due with unexpected scoring results as past points in time.

Anyway, we do hope to soon upgrade to a newer version of Lucene (how new 
will depend upon our ability to provide re-indexing capability to existing 
customers' v3.6.2 existing indexes).
My question is: is it likely that this occasional scoring aberrations have 
been fixed and/or reduced in later versions (say 5.x or 6.x)?

Thank you for any info. 

Jeff Wallace
Software Development, FileNet
IBM Corp.
1540 Scenic Ave.
Costa Mesa, CA 92626
(714) 327-7163 direct



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org