You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2006/06/30 22:20:30 UTC
[jira] Commented: (LUCENE-285) David Spencer Spell Checker improved
[ http://issues.apache.org/jira/browse/LUCENE-285?page=comments#action_12418705 ]
Otis Gospodnetic commented on LUCENE-285:
-----------------------------------------
Hi Cédrik. Yes, please open a new issue and attach the patch if you have it. It looks like this would create a commons-lang dependency, in which case the build script for the spell checker might need a small tweak, too.
> David Spencer Spell Checker improved
> ------------------------------------
>
> Key: LUCENE-285
> URL: http://issues.apache.org/jira/browse/LUCENE-285
> Project: Lucene - Java
> Type: Improvement
> Components: Search
> Versions: unspecified
> Environment: Operating System: other
> Platform: All
> Reporter: Nicolas Maisonneuve
> Priority: Minor
> Attachments: spellchecker.zip
>
> hy,
> i developed a SpellChecker based on the David Spencer code (DSc) but more flexible.
> the structure of the index is inspired of the DSc (for a 3-4 gram):
> word:
> gram3:
> gram4:
>
> 3start:
> 4start:
> ..
> 3end:
> 4end:
> ..
> transposition:
>
> This index is a dictonary so there isn't the "freq" field like with DSc version.
> it's independant of the user index. So we can add words becoming to several
> fields of several index for example or, why not, to a file with a list of words.
> The suggestSimilar method return a list of suggests word sorted by the
> Levenshtein distance and optionaly to the popularity of the word for a specific
> field in a user index. More of that, this list can be restricted only to words
> present in a specific field of a user index.
>
> See the test case.
>
> i hope this code will be put in the lucene sandbox.
>
> Nicolas Maisonneuve
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org