You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "James Dyer (JIRA)" <ji...@apache.org> on 2010/07/23 17:58:53 UTC
[jira] Commented: (SOLR-732) Collation bug
[ https://issues.apache.org/jira/browse/SOLR-732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891646#action_12891646 ]
James Dyer commented on SOLR-732:
---------------------------------
I do not think this is a bug. Suggestions are ordered by score (ie. Levenstein Distance) rather than # hits. See org.apache.lucene.search.spell.SuggestWord.compareTo() . The "score" variable is set in org.apache.lucene.search.spell.SpellChecker.suggestSimilar.
In working with the spellchecker, if setting spellcheck.count to a high value (like 100), I've often gotten results far down the list with a lot more hits than the ones early in the list but the word is obviously a less-likely correction than the ones higher up.
Perhaps this old ticket can be closed?
> Collation bug
> -------------
>
> Key: SOLR-732
> URL: https://issues.apache.org/jira/browse/SOLR-732
> Project: Solr
> Issue Type: Bug
> Components: spellchecker
> Affects Versions: 1.3
> Reporter: Matthew Runo
> Priority: Minor
>
> Search term: Quicksilver... I get two suggestions...
> <lst name="suggestion">
> <int name="frequency">2</int>
> <str name="word">Quicksilver</str>
> </lst>
> <lst name="suggestion">
> <int name="frequency">220</int>
> <str name="word">Quiksilver</str>
> </lst>
> ...and it's not correctly spelled...
> <bool name="correctlySpelled">false</bool>
> ...but the collation is of the first term - not the one with the highest frequency?
> <str name="collation">Quicksilver</str>
> Other collations, for example, 'runnning' come up with more than one suggestion (cunning, running) but properly pick the 'best bet' based on frequency.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org