You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2019/06/11 20:00:06 UTC

[jira] [Commented] (ARROW-840) [Python] Provide Python API for creating user-defined data types that can survive Arrow IPC

    [ https://issues.apache.org/jira/browse/ARROW-840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861428#comment-16861428 ] 

Wes McKinney commented on ARROW-840:
------------------------------------

[~jorisvandenbossche] [~pitrou] what do you think is the path ahead for this project? There are two distinct areas of work:

* Defining extension types in Python and providing Python exposure for already C++-defined types
* Bridging between extension types (which may be C++ or Python-defined) and pandas

This issue ARROW-840 covers the former but not the latter. It would be nice to have this feature available in 0.14.0. I'm concerned from today's date (June 11) that we may miss the window for 0.14.0 though

> [Python] Provide Python API for creating user-defined data types that can survive Arrow IPC
> -------------------------------------------------------------------------------------------
>
>                 Key: ARROW-840
>                 URL: https://issues.apache.org/jira/browse/ARROW-840
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: Python
>            Reporter: Wes McKinney
>            Priority: Major
>             Fix For: 0.14.0
>
>
> The user will provide:
> * Data type subclass that can indicate the physical storage type
> * "get state" and "set state" functions for serializing custom metadata to bytes
> * An optional function for "boxing" scalar values from the physical array storage
> Internally, this will build on an analogous C++ API for defining user data types



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)