You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "James Dyer (JIRA)" <ji...@apache.org> on 2013/05/01 18:48:15 UTC
[jira] [Updated] (SOLR-3240) add spellcheck 'approximate collation
count' mode
[ https://issues.apache.org/jira/browse/SOLR-3240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Dyer updated SOLR-3240:
-----------------------------
Attachment: SOLR-3240.patch
Here is an updated patch for Trunk which I plan to commit soon.
Note that this patch's EarlyTerminatingCollector is similar to the recently-added (LUCENE-4858) EarlyTerminatingSortingCollector. However, there seems to be enough differences that I did not attempt to combine the two. I have E.T.C. in a ".solr" package, but possibly this showuld be on a ".lucene" package instead?
Any review or comments are appreciated.
> add spellcheck 'approximate collation count' mode
> -------------------------------------------------
>
> Key: SOLR-3240
> URL: https://issues.apache.org/jira/browse/SOLR-3240
> Project: Solr
> Issue Type: Improvement
> Components: spellchecker
> Reporter: Robert Muir
> Attachments: SOLR-3240.patch, SOLR-3240.patch, SOLR-3240.patch
>
>
> SpellCheck's Collation in Solr is a way to ensure spellcheck/suggestions
> will actually net results (taking into account context like filtering).
> In order to do this (from my understanding), it generates candidate queries,
> executes them, and saves the total hit count: collation.setHits(hits).
> For a large index it seems this might be doing too much work: in particular
> I'm interested in ensuring this feature can work fast enough/well for autosuggesters.
> So I think we should offer an 'approximate' mode that uses an early-terminating
> Collector, collect()ing only N docs (e.g. n=1), and we approximate this result
> count based on docid space.
> I'm not sure what needs to happen on the solr side (possibly support for custom collectors?),
> but I think this could help and should possibly be the default.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org