You are viewing a plain text version of this content. The canonical link for it is here.

Posted to jira@arrow.apache.org by "Neal Richardson (Jira)" <ji...@apache.org> on 2021/02/23 23:17:00 UTC

[jira] [Assigned] (ARROW-10403) [C++] Implement unique kernel for dictionary type

     [ https://issues.apache.org/jira/browse/ARROW-10403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neal Richardson reassigned ARROW-10403:
---------------------------------------

    Assignee: Rok Mihevc  (was: Ben Kietzman)

> [C++] Implement unique kernel for dictionary type
> -------------------------------------------------
>
>                 Key: ARROW-10403
>                 URL: https://issues.apache.org/jira/browse/ARROW-10403
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>            Reporter: Neal Richardson
>            Assignee: Rok Mihevc
>            Priority: Major
>
> Calling the "unique" compute function on a ChunkedArray of dictionary type (as read by the CSV reader) errors with "Only hashing for data with equal dictionaries currently supported". But is it necessary to hash to get unique values from a dictionary type? The dictionary values are the unique values (for each chunk), they're already there.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)