You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Kae Suarez (Jira)" <ji...@apache.org> on 2022/08/16 23:11:00 UTC
[jira] [Created] (ARROW-17443) [C++] RecordBatch Make() with Arrow Arrays could infer length
Kae Suarez created ARROW-17443:
----------------------------------
Summary: [C++] RecordBatch Make() with Arrow Arrays could infer length
Key: ARROW-17443
URL: https://issues.apache.org/jira/browse/ARROW-17443
Project: Apache Arrow
Issue Type: Improvement
Components: C++
Reporter: Kae Suarez
Currently, if you use ChunkedArrays to make a Table, the num_rows argument is optional – if left as-is, the Table can just figure out the amount from the ChunkedArrays, and use that to initialize the Table. If the user wants a subset, instead, the user can fill in this argument.
RecordBatch, when made with Arrays, requires the number of rows to be supplied, no matter what, leading users to do things like pass arr->length() when they just want all their data.
Could RecordBatch's Array-using Make() method be changed to match the behavior of Table's ChunkedArray-using Make() method? If only for the sake of consistency?
--
This message was sent by Atlassian Jira
(v8.20.10#820010)