You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2017/04/14 19:57:42 UTC
[jira] [Created] (ARROW-823) [Python] Devise a means to serialize
arrays of arbitrary Python objects in Arrow IPC messages
Wes McKinney created ARROW-823:
----------------------------------
Summary: [Python] Devise a means to serialize arrays of arbitrary Python objects in Arrow IPC messages
Key: ARROW-823
URL: https://issues.apache.org/jira/browse/ARROW-823
Project: Apache Arrow
Issue Type: New Feature
Components: Python
Reporter: Wes McKinney
Practically speaking, this would involve a "custom" logical type that is "pyobject", represented physically as an array of 64-bit pointers. On serialization, this would need to be converted to a BinaryArray containing pickled objects as binary values
At the moment, we don't yet have the machinery to deal with "custom" types where the in-memory representation is different from the on-wire representation. This would be a useful use case to work through the design issues
Interestingly, if done properly, this would enable other Arrow implementations to manipulate (filter, etc.) serialized Python objects as binary blobs.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)