You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "David Li (Jira)" <ji...@apache.org> on 2021/09/27 17:52:00 UTC

[jira] [Assigned] (ARROW-13876) [C++] Uniform null handling in compute functions

     [ https://issues.apache.org/jira/browse/ARROW-13876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

David Li reassigned ARROW-13876:
--------------------------------

    Assignee: David Li

> [C++] Uniform null handling in compute functions
> ------------------------------------------------
>
>                 Key: ARROW-13876
>                 URL: https://issues.apache.org/jira/browse/ARROW-13876
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>            Reporter: Weston Pace
>            Assignee: David Li
>            Priority: Major
>              Labels: kernel, types
>             Fix For: 6.0.0
>
>
> The compute functions today have mixed support for null types.
> Unary arithmetic functions (e.g. abs) don't support null arrays
> Binary arithmetic functions (e.g. add) support one null array (e.g. int32 + null) but not both null arrays (i.e. null + null) but they do support both values being null (e.g. [null] + [null] = [null] if dtype=int32 but not supported if dtype=null)
> sort_indices should support null arrays.
> Some functions do forward null arrays:
>  - unique
> Some functions output a non-null type given null inputs
> - is_null (=> boolean)
> - is_valid (=> boolean)
> - value_counts (=> struct)
> - dictionary_encode (=> dictionary<null>)
> - count (=> int64)
> Some functions throw an error other than "not implemented"
>  - list_parent_indices



--
This message was sent by Atlassian Jira
(v8.3.4#803005)