You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "James Dyer (JIRA)" <ji...@apache.org> on 2010/08/23 22:04:16 UTC

[jira] Created: (SOLR-2083) Problem with Distributed SpellCheck

Problem with Distributed SpellCheck
-----------------------------------

                 Key: SOLR-2083
                 URL: https://issues.apache.org/jira/browse/SOLR-2083
             Project: Solr
          Issue Type: Bug
          Components: spellchecker
            Reporter: James Dyer
            Priority: Minor


In DistributedSpellCheckTest, if I add 10 additional documents to the index with field "lowerfilt" containing "The quack red fox jumped over the lazy brown dogs.", then the shard'ed SpellCheckComponent wants to correct "quick" to "quack".  The control, non-shared'ed component correctly does not try to correct "quick".  The test subsequently fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Resolved: (SOLR-2083) Problem with Distributed SpellCheck

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll resolved SOLR-2083.
-----------------------------------

    Fix Version/s: 3.1
                   4.0
       Resolution: Fixed

Committed.  Thanks James!

> Problem with Distributed SpellCheck
> -----------------------------------
>
>                 Key: SOLR-2083
>                 URL: https://issues.apache.org/jira/browse/SOLR-2083
>             Project: Solr
>          Issue Type: Bug
>          Components: spellchecker
>            Reporter: James Dyer
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: SOLR-2083.patch, SOLR-2083.patch
>
>
> In DistributedSpellCheckTest, if I add 10 additional documents to the index with field "lowerfilt" containing "The quack red fox jumped over the lazy brown dogs.", then the shard'ed SpellCheckComponent wants to correct "quick" to "quack".  The control, non-shared'ed component correctly does not try to correct "quick".  The test subsequently fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Assigned: (SOLR-2083) Problem with Distributed SpellCheck

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll reassigned SOLR-2083:
-------------------------------------

    Assignee: Grant Ingersoll

> Problem with Distributed SpellCheck
> -----------------------------------
>
>                 Key: SOLR-2083
>                 URL: https://issues.apache.org/jira/browse/SOLR-2083
>             Project: Solr
>          Issue Type: Bug
>          Components: spellchecker
>            Reporter: James Dyer
>            Assignee: Grant Ingersoll
>            Priority: Minor
>         Attachments: SOLR-2083.patch, SOLR-2083.patch
>
>
> In DistributedSpellCheckTest, if I add 10 additional documents to the index with field "lowerfilt" containing "The quack red fox jumped over the lazy brown dogs.", then the shard'ed SpellCheckComponent wants to correct "quick" to "quack".  The control, non-shared'ed component correctly does not try to correct "quick".  The test subsequently fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-2083) Problem with Distributed SpellCheck

Posted by "James Dyer (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

James Dyer updated SOLR-2083:
-----------------------------

    Attachment: SOLR-2083.patch

I think there are 2 related problems here:

1. DistributedSpellCheckComponentTest.java runs the test 4 times:  first with 1 shard, then with 2 shards, etc.  In between iterations, it does not clear the Jetty data directories so the first shard from the 2-shard iteration has all the data from the 1-shard iteration, etc.  I can work around this by adding "del("*:*");" as the first line in doTest().  Unfortunately doing this makes the test fail.  I think the problem with the Tester is masking a failing test.

2. The Component ought to report a word as misspelled if *ALL* of the shards report it as not in the dictionary.  However, the current implementation returns a word as misspelled if *ANY* shard reports it as not in the dictionary.

This second patch version resolves the second issue.  The problem with the test may warrant its own issue. (I used the workaround here).

> Problem with Distributed SpellCheck
> -----------------------------------
>
>                 Key: SOLR-2083
>                 URL: https://issues.apache.org/jira/browse/SOLR-2083
>             Project: Solr
>          Issue Type: Bug
>          Components: spellchecker
>            Reporter: James Dyer
>            Priority: Minor
>         Attachments: SOLR-2083.patch, SOLR-2083.patch
>
>
> In DistributedSpellCheckTest, if I add 10 additional documents to the index with field "lowerfilt" containing "The quack red fox jumped over the lazy brown dogs.", then the shard'ed SpellCheckComponent wants to correct "quick" to "quack".  The control, non-shared'ed component correctly does not try to correct "quick".  The test subsequently fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-2083) Problem with Distributed SpellCheck

Posted by "James Dyer (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

James Dyer updated SOLR-2083:
-----------------------------

    Attachment: SOLR-2083.patch

This patch demos the problem.

> Problem with Distributed SpellCheck
> -----------------------------------
>
>                 Key: SOLR-2083
>                 URL: https://issues.apache.org/jira/browse/SOLR-2083
>             Project: Solr
>          Issue Type: Bug
>          Components: spellchecker
>            Reporter: James Dyer
>            Priority: Minor
>         Attachments: SOLR-2083.patch
>
>
> In DistributedSpellCheckTest, if I add 10 additional documents to the index with field "lowerfilt" containing "The quack red fox jumped over the lazy brown dogs.", then the shard'ed SpellCheckComponent wants to correct "quick" to "quack".  The control, non-shared'ed component correctly does not try to correct "quick".  The test subsequently fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org