You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Jeroen Steggink (JIRA)" <ji...@apache.org> on 2015/05/20 22:17:59 UTC

[jira] [Updated] (LUCENE-6493) MoreLikeThis minimum should match query

     [ https://issues.apache.org/jira/browse/LUCENE-6493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jeroen Steggink updated LUCENE-6493:
------------------------------------
    Attachment: LUCENE-6493.patch

> MoreLikeThis minimum should match query
> ---------------------------------------
>
>                 Key: LUCENE-6493
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6493
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/search
>    Affects Versions: 5.x
>            Reporter: Jeroen Steggink
>            Priority: Minor
>         Attachments: LUCENE-6493.patch
>
>
> When performing MoreLikeThis queries we have a lot of parameters at hand in order to finetune the results. However, one thing I personally missed, being able to set a minimum of matching terms.
> The advantage of this parameter is that Lucene will retrieve only the documents which have at least the given number of matching terms and you won't get any results of just one or two keyword matches when you are looking for documents that have more than a few matching interesting keywords.
> An example:
> doc1: lucene
> doc2: lucene release
> doc3: apache
> doc4: apache lucene
> doc5: apache lucene release
> Let's say the query is "apache lucene release" and you set the minimum number of matching interesting terms to 2.
> MoreLikeThis will return:
> doc2, doc4 and doc5
> Would you set it to 3, it would return just doc5.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org