You are viewing a plain text version of this content. The canonical link for it is here.
Posted to general@lucene.apache.org by john atsh <jo...@gmail.com> on 2009/03/23 13:46:59 UTC

Similarity

I want to change slightly the similarity function, in the following way:

Use same cosine similarity as defined by DefaultSimilarity, but multiply the
result score by f, where f is defined as following
f = (# of terms in query that appear also in document) / (# of terms in
document)

(this boosts documents that contain RELATIVELY more query terms)

What's the best way to do it, and how?

Thanks
John
-- 
View this message in context: http://www.nabble.com/Similarity-tp22659442p22659442.html
Sent from the Lucene - General mailing list archive at Nabble.com.