You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Micah Kornfield (Jira)" <ji...@apache.org> on 2021/08/23 05:51:00 UTC

[jira] [Created] (ARROW-13690) [Python] Use IPC writing code for pickling RecordBatches

Micah Kornfield created ARROW-13690:
---------------------------------------

             Summary: [Python] Use IPC writing code for pickling RecordBatches
                 Key: ARROW-13690
                 URL: https://issues.apache.org/jira/browse/ARROW-13690
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Micah Kornfield


For wide schemas in particular the the recursive nature of the currently pickling algorithm for record batches makes it less efficient then using the IPC format (which can be done entirely in C++).

 

Consider switching the mechanism to use the IPC format.  I think this can be a backwards compatible change if the current leaving: _reconstruct_record_batch in place if we care about that.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)