You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Michael McCandless (JIRA)" <ji...@apache.org> on 2013/01/18 16:14:13 UTC
[jira] [Commented] (LUCENE-4600) Explore facets aggregation during
documents collection
[ https://issues.apache.org/jira/browse/LUCENE-4600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13557263#comment-13557263 ]
Michael McCandless commented on LUCENE-4600:
--------------------------------------------
Patch looks great: +1
And this is a healthy speedup, on the Wikipedia 1M / 25 ords per doc test:
{noformat}
Task QPS base StdDev QPS comp StdDev Pct diff
PKLookup 239.18 (1.5%) 238.87 (1.1%) -0.1% ( -2% - 2%)
LowTerm 98.99 (3.1%) 135.95 (1.8%) 37.3% ( 31% - 43%)
HighTerm 20.95 (1.2%) 29.08 (2.4%) 38.8% ( 34% - 42%)
MedTerm 34.55 (1.5%) 48.31 (2.0%) 39.8% ( 35% - 43%)
{noformat}
> Explore facets aggregation during documents collection
> ------------------------------------------------------
>
> Key: LUCENE-4600
> URL: https://issues.apache.org/jira/browse/LUCENE-4600
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/facet
> Reporter: Michael McCandless
> Attachments: LUCENE-4600-cli.patch, LUCENE-4600.patch, LUCENE-4600.patch, LUCENE-4600.patch
>
>
> Today the facet module simply gathers all hits (as a bitset, optionally with a float[] to hold scores as well, if you will aggregate them) during collection, and then at the end when you call getFacetsResults(), it makes a 2nd pass over all those hits doing the actual aggregation.
> We should investigate just aggregating as we collect instead, so we don't have to tie up transient RAM (fairly small for the bit set but possibly big for the float[]).
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org