You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Weston Pace (Jira)" <ji...@apache.org> on 2021/09/03 01:17:00 UTC

[jira] [Created] (ARROW-13876) [C++] Uniform null handling in compute functions

Weston Pace created ARROW-13876:
-----------------------------------

             Summary: [C++] Uniform null handling in compute functions
                 Key: ARROW-13876
                 URL: https://issues.apache.org/jira/browse/ARROW-13876
             Project: Apache Arrow
          Issue Type: Bug
          Components: C++
            Reporter: Weston Pace


The compute functions today have mixed support for null types.

Unary arithmetic functions (e.g. abs) don't support null arrays

Binary arithmetic functions (e.g. add) support one null array (e.g. int32 + null) but not both null arrays (i.e. null + null) but they do support both values being null (e.g. [null] + [null] = [null] if dtype=int32 but not supported if dtype=null)

Some functions do forward null arrays:
 - unique

Some functions output a non-null type given null inputs

- is_null (=> boolean)
- is_valid (=> boolean)
- value_counts (=> struct)
- dictionary_encode (=> dictionary<null>)
- count (=> int64)


Some functions throw an error other than "not implemented"

 - list_parent_indices



--
This message was sent by Atlassian Jira
(v8.3.4#803005)