You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Kae Suarez (Jira)" <ji...@apache.org> on 2022/08/16 23:11:00 UTC

[jira] [Created] (ARROW-17443) [C++] RecordBatch Make() with Arrow Arrays could infer length

Kae Suarez created ARROW-17443:
----------------------------------

             Summary: [C++] RecordBatch Make() with Arrow Arrays could infer length
                 Key: ARROW-17443
                 URL: https://issues.apache.org/jira/browse/ARROW-17443
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: Kae Suarez


Currently, if you use ChunkedArrays to make a Table, the num_rows argument is optional – if left as-is, the Table can just figure out the amount from the ChunkedArrays, and use that to initialize the Table. If the user wants a subset, instead, the user can fill in this argument. 

RecordBatch, when made with Arrays, requires the number of rows to be supplied, no matter what, leading users to do things like pass arr->length() when they just want all their data. 

Could RecordBatch's Array-using Make() method be changed to match the behavior of Table's ChunkedArray-using Make() method? If only for the sake of consistency?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)