You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Weston Pace (Jira)" <ji...@apache.org> on 2021/09/03 03:16:00 UTC

[jira] [Updated] (ARROW-13876) [C++] Uniform null handling in compute functions

     [ https://issues.apache.org/jira/browse/ARROW-13876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Weston Pace updated ARROW-13876:
--------------------------------
    Description: 
The compute functions today have mixed support for null types.

Unary arithmetic functions (e.g. abs) don't support null arrays

Binary arithmetic functions (e.g. add) support one null array (e.g. int32 + null) but not both null arrays (i.e. null + null) but they do support both values being null (e.g. [null] + [null] = [null] if dtype=int32 but not supported if dtype=null)

sort_indices should support null arrays.

Some functions do forward null arrays:
 - unique

Some functions output a non-null type given null inputs

- is_null (=> boolean)
- is_valid (=> boolean)
- value_counts (=> struct)
- dictionary_encode (=> dictionary<null>)
- count (=> int64)


Some functions throw an error other than "not implemented"

 - list_parent_indices

  was:
The compute functions today have mixed support for null types.

Unary arithmetic functions (e.g. abs) don't support null arrays

Binary arithmetic functions (e.g. add) support one null array (e.g. int32 + null) but not both null arrays (i.e. null + null) but they do support both values being null (e.g. [null] + [null] = [null] if dtype=int32 but not supported if dtype=null)

Some functions do forward null arrays:
 - unique

Some functions output a non-null type given null inputs

- is_null (=> boolean)
- is_valid (=> boolean)
- value_counts (=> struct)
- dictionary_encode (=> dictionary<null>)
- count (=> int64)


Some functions throw an error other than "not implemented"

 - list_parent_indices


> [C++] Uniform null handling in compute functions
> ------------------------------------------------
>
>                 Key: ARROW-13876
>                 URL: https://issues.apache.org/jira/browse/ARROW-13876
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>            Reporter: Weston Pace
>            Priority: Major
>
> The compute functions today have mixed support for null types.
> Unary arithmetic functions (e.g. abs) don't support null arrays
> Binary arithmetic functions (e.g. add) support one null array (e.g. int32 + null) but not both null arrays (i.e. null + null) but they do support both values being null (e.g. [null] + [null] = [null] if dtype=int32 but not supported if dtype=null)
> sort_indices should support null arrays.
> Some functions do forward null arrays:
>  - unique
> Some functions output a non-null type given null inputs
> - is_null (=> boolean)
> - is_valid (=> boolean)
> - value_counts (=> struct)
> - dictionary_encode (=> dictionary<null>)
> - count (=> int64)
> Some functions throw an error other than "not implemented"
>  - list_parent_indices



--
This message was sent by Atlassian Jira
(v8.3.4#803005)