You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Yonik Seeley (JIRA)" <ji...@apache.org> on 2017/12/30 02:48:00 UTC

[jira] [Created] (SOLR-11806) Use indirection for hash-based faceting

Yonik Seeley created SOLR-11806:
-----------------------------------

             Summary: Use indirection for hash-based faceting
                 Key: SOLR-11806
                 URL: https://issues.apache.org/jira/browse/SOLR-11806
             Project: Solr
          Issue Type: Improvement
      Security Level: Public (Default Security Level. Issues are Public)
          Components: Facet Module
            Reporter: Yonik Seeley


Currently when hashing is used during faceting (FacetFieldProcessorByHashDV), all SlotAcc instances act as hash tables.  They are directly accessed by hash, and when the hash table needs to be resized, all SlotAcc instances are rehashed.  This is good when a single statistic is being used, but wasteful when multiple statistics are being calculated per bucket.

An alternative would be for the hash table to hash to a persistent slot (starting at 0 and incremented for each new bucket encountered).  All of the SlotAcc instances would then be dense and be indexed by the persistent slot.  This would also allow more efficient implementations of some metrics using hashes themselves (made easier since slots no longer change and can thus be part of a hash).





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org