You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Simon Willnauer (JIRA)" <ji...@apache.org> on 2013/03/06 18:20:13 UTC

[jira] [Updated] (LUCENE-4813) Allow DirectSpellchecker to use totalTermFrequency rather than docFrequency

     [ https://issues.apache.org/jira/browse/LUCENE-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Simon Willnauer updated LUCENE-4813:
------------------------------------

    Attachment: LUCENE-4813.patch

here is an initial patch that adds this as the default yet optional statistics.
                
> Allow DirectSpellchecker to use totalTermFrequency rather than docFrequency
> ---------------------------------------------------------------------------
>
>                 Key: LUCENE-4813
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4813
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/spellchecker
>    Affects Versions: 4.1
>            Reporter: Simon Willnauer
>             Fix For: 4.2, 5.0
>
>         Attachments: LUCENE-4813.patch
>
>
> we have a bunch of new statistics in on our term dictionaries that we should make use of where it makes sense. For DirectSpellChecker totalTermFreq and sumTotalTermFreq might be better suited for spell correction on top of a fulltext index than docFreq and maxDoc

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org