You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "James Dyer (JIRA)" <ji...@apache.org> on 2010/07/23 17:58:53 UTC

[jira] Commented: (SOLR-732) Collation bug

    [ https://issues.apache.org/jira/browse/SOLR-732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891646#action_12891646 ] 

James Dyer commented on SOLR-732:
---------------------------------

I do not think this is a bug.  Suggestions are ordered by score (ie. Levenstein Distance) rather than # hits.  See org.apache.lucene.search.spell.SuggestWord.compareTo() .  The "score" variable is set in org.apache.lucene.search.spell.SpellChecker.suggestSimilar.

In working with the spellchecker, if setting spellcheck.count to a high value (like 100), I've often gotten results far down the list with a lot more hits than the ones early in the list but the word is obviously a less-likely correction than the ones higher up.

Perhaps this old ticket can be closed?

> Collation bug
> -------------
>
>                 Key: SOLR-732
>                 URL: https://issues.apache.org/jira/browse/SOLR-732
>             Project: Solr
>          Issue Type: Bug
>          Components: spellchecker
>    Affects Versions: 1.3
>            Reporter: Matthew Runo
>            Priority: Minor
>
> Search term: Quicksilver... I get two suggestions...
> <lst name="suggestion">
> <int name="frequency">2</int>
> <str name="word">Quicksilver</str>
> </lst>
> <lst name="suggestion">
> <int name="frequency">220</int>
> <str name="word">Quiksilver</str>
> </lst>
> ...and it's not correctly spelled...
> <bool name="correctlySpelled">false</bool>
> ...but the collation is of the first term - not the one with the highest frequency?
> <str name="collation">Quicksilver</str>
> Other collations, for example, 'runnning' come up with more than one suggestion (cunning, running) but properly pick the 'best bet' based on frequency. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org