You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Julie Tibshirani (JIRA)" <ji...@apache.org> on 2014/07/03 06:03:25 UTC

[jira] [Updated] (LUCENE-5363) Inconsistent behavior for phrase queries with leading stopwords

     [ https://issues.apache.org/jira/browse/LUCENE-5363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julie Tibshirani updated LUCENE-5363:
-------------------------------------

    Description: 
Phrase queries with leading stopwords exhibit inconsistent behavior. For example, the phrase query "the president of the united states" matches "hello president of the united states", but not "president of the united states".

As far as I can tell, this issue arose in 4.0 when ExactPhraseScorer was re-implemented. In 3.0, "the president of the united states" matches both "hello president of the united states" and "president of the united states".

A test case that triggers the bug is attached.

  was:
Phrase queries with leading stopwords exhibit inconsistent behavior. For example, the phrase query "the president of the united states" matches "hello president of the united states", but not "president of the united states".

As far as I can tell, this issue arose in 4.0 when ExactPhraseScorer was re-implemented. In 3.0, "the president of the united states" matches both "hello of the united states" and "president of the united states".

A test case that triggers the bug is attached.


> Inconsistent behavior for phrase queries with leading stopwords 
> ----------------------------------------------------------------
>
>                 Key: LUCENE-5363
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5363
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/search
>    Affects Versions: 4.0
>            Reporter: Julie Tibshirani
>         Attachments: PhraseQueryWithLeadingStopwordsTest.java
>
>
> Phrase queries with leading stopwords exhibit inconsistent behavior. For example, the phrase query "the president of the united states" matches "hello president of the united states", but not "president of the united states".
> As far as I can tell, this issue arose in 4.0 when ExactPhraseScorer was re-implemented. In 3.0, "the president of the united states" matches both "hello president of the united states" and "president of the united states".
> A test case that triggers the bug is attached.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org