You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Rok Mihevc (Jira)" <ji...@apache.org> on 2021/04/08 16:03:00 UTC

[jira] [Created] (ARROW-12301) [C++][Compute] Use generic hash-aggregate for DictionaryArrays

Rok Mihevc created ARROW-12301:
----------------------------------

             Summary: [C++][Compute] Use generic hash-aggregate for DictionaryArrays
                 Key: ARROW-12301
                 URL: https://issues.apache.org/jira/browse/ARROW-12301
             Project: Apache Arrow
          Issue Type: Improvement
            Reporter: Rok Mihevc


When calculating unique for chunked DictionaryArrays we currently run through all chunks and unify their dictionaries and then collect chunk indices. We could avoid the dictionary unification by using a generic hash.

[See discussion here.|https://github.com/apache/arrow/pull/9683]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)