You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by alexpusch <al...@getjaco.com> on 2017/12/12 09:39:08 UTC

Performance issues with 'unique' function in json facets over a high cardinality field

Hi,
I have a surprising performance issue with the 'unique' function in a json
facet

My setup holds large amount of docs (~1B), despite this large number I only
facet on a small result set of a query, only a few  docs. The query itself
returns as fast as expected, but when I try to do a unique count on one of
the fields using json.facet the query takes much longer. 

Facet time remains constant when I try to do it over a much larger set of
docs. This leads me to believe that this unique count actually depends on
overall field cardinality and not the cardinality in the result set. Am I
right?

This phenomena occurs both in a high level facet, and a sub facet
calculation, which I actually interested in.

Is there a way to facet, and sub-facet over a field with overall high
cardinality, but small cardinality in the result set?

My setup is Solr 6.0 in a Datastax Enterprise cluster

example queries:





Thanks,
Alex



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html