You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "James Dyer (JIRA)" <ji...@apache.org> on 2014/12/11 15:47:13 UTC

[jira] [Commented] (SOLR-6817) Collation suggestions regard correctly spelt words as incorrect.

    [ https://issues.apache.org/jira/browse/SOLR-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242626#comment-14242626 ] 

James Dyer commented on SOLR-6817:
----------------------------------

I'm not convinced this is a bug, so before we continue on this route, can you post this as a question on the solr user's list?

Include:
- Your spelling config from solrconfig.xml
- The two queries request URLs you are trying
- The full spelling response for each query

> Collation suggestions regard correctly spelt words as incorrect.
> ----------------------------------------------------------------
>
>                 Key: SOLR-6817
>                 URL: https://issues.apache.org/jira/browse/SOLR-6817
>             Project: Solr
>          Issue Type: Bug
>    Affects Versions: 4.10.2
>            Reporter: Ye Yan
>            Priority: Minor
>         Attachments: SpellCheckCollator.java, solr_collation.patch
>
>
> We've ran into an issue where if you have one word spelt correctly and one word spelt incorrectly, then the collation suggestions will only return alternatives for each word.  This is not the case if both words are spelt incorrectly.
> I.e, if you do a search for "ipod toach", it will suggest alternatives for both "ipod" and "toach".  The collation suggestions will not include "ipod touch" which would be the desired suggestion.
> If you do a search for "ipodd toach", the collation suggestions will include "ipod touch" as both words are incorrectly spelt.
> So you need to spell everything wrong, rather than some things wrong for the spell check collator to work correctly.  In the real world, some people will spell only a few words wrong in a query, not everything wrong, so it is a bit of a usability issue.
> I've created a workaround for the collator to include the original term in the collation suggestions if itself is spelt correctly, rather than use suggestions of the correctly spelt word.
> This has a bit of a performance hit, as we need to query to make sure the word is spelt correctly, so I think there might be a more deeper way of changing it, but works as intended for now.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org