You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2017/11/01 17:00:00 UTC
[jira] [Commented] (LUCENE-7994) Use int/int hash map for int
taxonomy facet counts
[ https://issues.apache.org/jira/browse/LUCENE-7994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16234381#comment-16234381 ]
ASF subversion and git services commented on LUCENE-7994:
---------------------------------------------------------
Commit 77e6e291bf34ffaa6f1afc2d9c64779f4b250b65 in lucene-solr's branch refs/heads/master from Mike McCandless
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=77e6e29 ]
LUCENE-7994: use int/int scatter map to count facets when number of hits is small relative to number of unique facet labels
> Use int/int hash map for int taxonomy facet counts
> --------------------------------------------------
>
> Key: LUCENE-7994
> URL: https://issues.apache.org/jira/browse/LUCENE-7994
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Priority: Major
> Fix For: master (8.0), 7.2
>
> Attachments: LUCENE-7994.patch, LUCENE-7994.patch
>
>
> Int taxonomy facets today always count into a dense {{int[]}}, which is wasteful in cases where the number of unique facet labels is high and the size of the current result set is small.
> I factored the native hash map from LUCENE-7927 and use a simple heuristic (customizable by the user by subclassing) to decide up front whether to count sparse or dense. I also made loading of the large children and siblings {{int[]}} lazy, so that they are only instantiated if you really need them.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org