You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2022/12/14 14:40:00 UTC
[jira] [Assigned] (ARROW-18423) [Python] Expose reading a schema from an IPC message
[ https://issues.apache.org/jira/browse/ARROW-18423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Antoine Pitrou reassigned ARROW-18423:
--------------------------------------
Assignee: Andre Kohn
> [Python] Expose reading a schema from an IPC message
> ----------------------------------------------------
>
> Key: ARROW-18423
> URL: https://issues.apache.org/jira/browse/ARROW-18423
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Andre Kohn
> Assignee: Andre Kohn
> Priority: Major
> Labels: pull-request-available
> Fix For: 11.0.0
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> Pyarrow currently does not implement the reading of an Arrow schema from an IPC message.
> [https://github.com/apache/arrow/blob/80b389efe902af376a85a8b3740e0dbdc5f80900/python/pyarrow/ipc.pxi#L1094]
>
> We'd like to consume an Arrow IPC stream like the following:
>
> {code:java}
> schema_msg = pyarrow.ipc.read_message(result_iter.next().data)
> schema = pyarrow.ipc.read_schema(schema_msg)
> for batch_data in result_iter:
> batch_msg = pyarrow.ipc.read_message(batch_data.data)
> batch = pyarrow.ipc.read_record_batch(batch_msg, schema){code}
>
> The associated (tiny) PR on GitHub implements this reading by binding the existing C++ function.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)