You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Rok Mihevc (Jira)" <ji...@apache.org> on 2021/04/08 16:03:00 UTC
[jira] [Created] (ARROW-12301) [C++][Compute] Use generic
hash-aggregate for DictionaryArrays
Rok Mihevc created ARROW-12301:
----------------------------------
Summary: [C++][Compute] Use generic hash-aggregate for DictionaryArrays
Key: ARROW-12301
URL: https://issues.apache.org/jira/browse/ARROW-12301
Project: Apache Arrow
Issue Type: Improvement
Reporter: Rok Mihevc
When calculating unique for chunked DictionaryArrays we currently run through all chunks and unify their dictionaries and then collect chunk indices. We could avoid the dictionary unification by using a generic hash.
[See discussion here.|https://github.com/apache/arrow/pull/9683]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)