You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by GitBox <gi...@apache.org> on 2022/06/01 21:17:46 UTC

[GitHub] [lucene] pminkov opened a new pull request, #939: Use similarity.tf() in MoreLikeThis

pminkov opened a new pull request, #939:
URL: https://github.com/apache/lucene/pull/939

   ### Description
   
   MoreLikeThis picks terms by their TF-IDF score. The TF part of the score was used by taking the term frequency directly, without applying a square root through ClassicSimilarity.tf(). The result of this is that how common a term is in an input can have too much weight on whether it's selected as a search term. This can make more stop words make their way into the final query.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org


[GitHub] [lucene] pminkov closed pull request #939: Use similarity.tf() in MoreLikeThis

Posted by GitBox <gi...@apache.org>.
pminkov closed pull request #939: Use similarity.tf() in MoreLikeThis
URL: https://github.com/apache/lucene/pull/939


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org