You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Uwe L. Korn (JIRA)" <ji...@apache.org> on 2016/03/30 09:07:25 UTC

[jira] [Commented] (ARROW-86) Python: Implement zero-copy Arrow-to-Pandas conversion

    [ https://issues.apache.org/jira/browse/ARROW-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15217539#comment-15217539 ] 

Uwe L. Korn commented on ARROW-86:
----------------------------------

No need to subclass {{ndarray}}, we can simply set the reference via {{PyArray_SetBaseObject}}. 

The only remaining problem is now that {{PyArray_SimpleNewFromData}} expects a {{void*}} pointer but our data is {{const}}.

> Python: Implement zero-copy Arrow-to-Pandas conversion
> ------------------------------------------------------
>
>                 Key: ARROW-86
>                 URL: https://issues.apache.org/jira/browse/ARROW-86
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Uwe L. Korn
>
> For this to work, we need:
>  * a check if the Arrow Array could be represented as Pandas Series using the same memory buffer
>  * a {{numpy.ndarray}} subclass that keeps a reference to the Arrow Array
> With the NumPy C-API it is already possible to create a {{numpy.ndarray}} instance that uses the Arrow Array memory buffer internally but there is no way to keep track if the {{numpy.ndarray}} still lives or the underlying memory can be free'd.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)