You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2020/03/05 00:55:00 UTC
[jira] [Created] (ARROW-8004) [Python] Define API for user-defined
conversions of array cell values in pyarrow.array
Wes McKinney created ARROW-8004:
-----------------------------------
Summary: [Python] Define API for user-defined conversions of array cell values in pyarrow.array
Key: ARROW-8004
URL: https://issues.apache.org/jira/browse/ARROW-8004
Project: Apache Arrow
Issue Type: Improvement
Components: Python
Reporter: Wes McKinney
Consider the statement
{code}
pyarrow.array([v0, v1, v2, v3])
{code}
or correspondingly
{code}
pyarrow.array(pd.Series([v0, v1, v2, v3], dtype=object))
{code}
where {{v0, ..., v4}} are instances of types with no built-in conversion-to-Arrow support in pyarrow. An API could be provided to allow user-defined unboxing to a data type that the library _does_ understand (like a NumPy array). One complexity is that if the unboxing is costly, we may need to "keep around" the unboxed value when doing multiple passes over the data (e.g. initially for type inference and then for conversion)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)