You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2020/03/05 00:55:00 UTC

[jira] [Created] (ARROW-8004) [Python] Define API for user-defined conversions of array cell values in pyarrow.array

Wes McKinney created ARROW-8004:
-----------------------------------

             Summary: [Python] Define API for user-defined conversions of array cell values in pyarrow.array
                 Key: ARROW-8004
                 URL: https://issues.apache.org/jira/browse/ARROW-8004
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Wes McKinney


Consider the statement

{code}
pyarrow.array([v0, v1, v2, v3])
{code}

or correspondingly

{code}
pyarrow.array(pd.Series([v0, v1, v2, v3], dtype=object))
{code}

where {{v0, ..., v4}} are instances of types with no built-in conversion-to-Arrow support in pyarrow. An API could be provided to allow user-defined unboxing to a data type that the library _does_ understand (like a NumPy array). One complexity is that if the unboxing is costly, we may need to "keep around" the unboxed value when doing multiple passes over the data (e.g. initially for type inference and then for conversion)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)