You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "jorisvandenbossche (via GitHub)" <gi...@apache.org> on 2023/06/30 12:30:08 UTC

[GitHub] [arrow] jorisvandenbossche commented on issue #36398: [Python] Add missing APIs to Table class

jorisvandenbossche commented on issue #36398:
URL: https://github.com/apache/arrow/issues/36398#issuecomment-1614584904

   Those might be intentional differences:
   
   - `_export_to_c` / `_import_from_c`: for RecordBatch, this exports the batch as a `ArrowArray`. However, that's not possible for Table, which can be chunked. In principle we could add ArrowArrayStream import/export to Table, but the Table's columns are also not guaranteed to be consistently chunked, so this might give some complexitities. Although we already support converting the Table to a RecordBatchReader (which then supports the ArrowArrayStream), and that is essentially the same conversion.
   - `from_struct_array` / `to_struct_array` are specific for RecordBatch, as StructArray and RecordBatch are memory-wise almost equivalent, which is not necessarily true for a Table
   - `serialize` writes RecordBatch to Buffer as encapsulated IPC message, but at the IPC spec level, we only have RecordBatch, and the concept of Table doesn't exist (so there is no "table" message)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org