You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "mark risher (JIRA)" <ji...@apache.org> on 2011/05/11 23:45:47 UTC

[jira] [Created] (SOLR-2510) Proximity search is not symmetric

Proximity search is not symmetric
---------------------------------

                 Key: SOLR-2510
                 URL: https://issues.apache.org/jira/browse/SOLR-2510
             Project: Solr
          Issue Type: Bug
          Components: search, web gui
    Affects Versions: 3.1
         Environment: Ubuntu 10.04
            Reporter: mark risher


The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are less-than N words before and less-than-or-equal-to N words after.

For example, use the following document:
   WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G

Expected result: Both of the following queries should match:
1) "WORD_D WORD_G"~3
2) "WORD_D WORD_A"~3

Actual result: Only #1 matches.



--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-2510) Proximity search is not symmetric

Posted by "mark risher (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

mark risher updated SOLR-2510:
------------------------------

    Description: 
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
   WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G

Expected result: Both of the following queries should match:
1) "WORD_D WORD_G"~3
2) "WORD_D WORD_A"~3

Actual result: Only #1 matches.



  was:
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are less-than N words before and less-than-or-equal-to N words after.

For example, use the following document:
   WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G

Expected result: Both of the following queries should match:
1) "WORD_D WORD_G"~3
2) "WORD_D WORD_A"~3

Actual result: Only #1 matches.




> Proximity search is not symmetric
> ---------------------------------
>
>                 Key: SOLR-2510
>                 URL: https://issues.apache.org/jira/browse/SOLR-2510
>             Project: Solr
>          Issue Type: Bug
>          Components: search, web gui
>    Affects Versions: 3.1
>         Environment: Ubuntu 10.04
>            Reporter: mark risher
>
> The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.
> For example, use the following document:
>    WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G
> Expected result: Both of the following queries should match:
> 1) "WORD_D WORD_G"~3
> 2) "WORD_D WORD_A"~3
> Actual result: Only #1 matches.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-2510) Proximity search is not symmetric

Posted by "mark risher (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

mark risher updated SOLR-2510:
------------------------------

    Description: 
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
   {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}

__Expected result:__ Both of the following queries should match:
1) {{"WORD_D WORD_G"~3}}
2) {{"WORD_D WORD_A"~3}}

__Actual result:__ Only #1 matches.



  was:
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
{{   WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}

__Expected result:__ Both of the following queries should match:
1) {{"WORD_D WORD_G"~3}}
2) {{"WORD_D WORD_A"~3}}

__Actual result:__ Only #1 matches.




> Proximity search is not symmetric
> ---------------------------------
>
>                 Key: SOLR-2510
>                 URL: https://issues.apache.org/jira/browse/SOLR-2510
>             Project: Solr
>          Issue Type: Bug
>          Components: search, web gui
>    Affects Versions: 3.1
>         Environment: Ubuntu 10.04
>            Reporter: mark risher
>
> The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.
> For example, use the following document:
>    {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}
> __Expected result:__ Both of the following queries should match:
> 1) {{"WORD_D WORD_G"~3}}
> 2) {{"WORD_D WORD_A"~3}}
> __Actual result:__ Only #1 matches.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Resolved] (SOLR-2510) Proximity search is not symmetric

Posted by "Hoss Man (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hoss Man resolved SOLR-2510.
----------------------------

    Resolution: Not A Problem

This is the expected behavior for Phrase queries.

"slop" is specified as an edit distance...
http://lucene.apache.org/java/2_9_1/api/core/org/apache/lucene/search/PhraseQuery.html#setSlop%28int%29

These two queries are not equivalent...

{noformat}
  "WORD_D WORD_G"~3
  "WORD_G WORD_D"~3
{noformat}

the order of the terms as specified in the PhrasQuery matters for determining the edit distance.

> Proximity search is not symmetric
> ---------------------------------
>
>                 Key: SOLR-2510
>                 URL: https://issues.apache.org/jira/browse/SOLR-2510
>             Project: Solr
>          Issue Type: Bug
>          Components: search, web gui
>    Affects Versions: 3.1
>         Environment: Ubuntu 10.04
>            Reporter: mark risher
>
> The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.
> For example, use the following document:
>    {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}
> *Expected result:* Both of the following queries should match:
> 1) {{"WORD_D WORD_G"~3}}
> 2) {{"WORD_G WORD_D"~3}}
> *Actual result:* Only #1 matches. For some reason, it thinks the distance from D to G is 3, but from G to D is 4.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-2510) Proximity search is not symmetric

Posted by "mark risher (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

mark risher updated SOLR-2510:
------------------------------

    Description: 
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
{{   WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}

__Expected result:__ Both of the following queries should match:
1) {{"WORD_D WORD_G"~3}}
2) {{"WORD_D WORD_A"~3}}

__Actual result:__ Only #1 matches.



  was:
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
   WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G

Expected result: Both of the following queries should match:
1) "WORD_D WORD_G"~3
2) "WORD_D WORD_A"~3

Actual result: Only #1 matches.




> Proximity search is not symmetric
> ---------------------------------
>
>                 Key: SOLR-2510
>                 URL: https://issues.apache.org/jira/browse/SOLR-2510
>             Project: Solr
>          Issue Type: Bug
>          Components: search, web gui
>    Affects Versions: 3.1
>         Environment: Ubuntu 10.04
>            Reporter: mark risher
>
> The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.
> For example, use the following document:
> {{   WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}
> __Expected result:__ Both of the following queries should match:
> 1) {{"WORD_D WORD_G"~3}}
> 2) {{"WORD_D WORD_A"~3}}
> __Actual result:__ Only #1 matches.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-2510) Proximity search is not symmetric

Posted by "mark risher (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

mark risher updated SOLR-2510:
------------------------------

    Description: 
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
   {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}

*Expected result:* Both of the following queries should match:
1) {{"WORD_D WORD_G"~3}}
2) {{"WORD_G WORD_D"~3}}

*Actual result:* Only #1 matches. For some reason, it thinks the distance from D to G is 3, but from G to D is 4.



  was:
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
   {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}

*Expected result:* Both of the following queries should match:
1) {{"WORD_D WORD_G"~3}}
2) {{"WORD_D WORD_A"~3}}

*Actual result:* Only #1 matches.




> Proximity search is not symmetric
> ---------------------------------
>
>                 Key: SOLR-2510
>                 URL: https://issues.apache.org/jira/browse/SOLR-2510
>             Project: Solr
>          Issue Type: Bug
>          Components: search, web gui
>    Affects Versions: 3.1
>         Environment: Ubuntu 10.04
>            Reporter: mark risher
>
> The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.
> For example, use the following document:
>    {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}
> *Expected result:* Both of the following queries should match:
> 1) {{"WORD_D WORD_G"~3}}
> 2) {{"WORD_G WORD_D"~3}}
> *Actual result:* Only #1 matches. For some reason, it thinks the distance from D to G is 3, but from G to D is 4.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-2510) Proximity search is not symmetric

Posted by "mark risher (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

mark risher updated SOLR-2510:
------------------------------

    Description: 
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
   {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}

*Expected result:* Both of the following queries should match:
1) {{"WORD_D WORD_G"~3}}
2) {{"WORD_D WORD_A"~3}}

*Actual result:* Only #1 matches.



  was:
The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.

For example, use the following document:
   {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}

__Expected result:__ Both of the following queries should match:
1) {{"WORD_D WORD_G"~3}}
2) {{"WORD_D WORD_A"~3}}

__Actual result:__ Only #1 matches.




> Proximity search is not symmetric
> ---------------------------------
>
>                 Key: SOLR-2510
>                 URL: https://issues.apache.org/jira/browse/SOLR-2510
>             Project: Solr
>          Issue Type: Bug
>          Components: search, web gui
>    Affects Versions: 3.1
>         Environment: Ubuntu 10.04
>            Reporter: mark risher
>
> The proximity search is incorrect on words occurring *before* the matching term. It matches documents that are _less-than_ N words before and _less-than-or-equal-to_ N words after.
> For example, use the following document:
>    {{WORD_A WORD_B WORD_C WORD_D WORD_E WORD_F WORD_G}}
> *Expected result:* Both of the following queries should match:
> 1) {{"WORD_D WORD_G"~3}}
> 2) {{"WORD_D WORD_A"~3}}
> *Actual result:* Only #1 matches.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org