You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Joris Van den Bossche (JIRA)" <ji...@apache.org> on 2019/07/04 21:33:00 UTC

[jira] [Created] (ARROW-5857) [Python] converting multidimensional numpy arrays to nested list type

Joris Van den Bossche created ARROW-5857:
--------------------------------------------

             Summary: [Python] converting multidimensional numpy arrays to nested list type
                 Key: ARROW-5857
                 URL: https://issues.apache.org/jira/browse/ARROW-5857
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Joris Van den Bossche


Currently we only support 1-dimensional numpy arrays:

{code:python}
In [28]: pa.array([np.array([[1, 2], [3, 4]])], type=pa.list_(pa.list_(pa.int64()))) 
...
ArrowInvalid: Can only convert 1-dimensional array values
{code}

So to create a nested list array, you can do that with lists of lists or object numpy arrays with arrays as elements. We could expand this support to multi-dimensional numpy arrays.

I am not sure we should do inference by default for this case, but at least when specifying a nested ListType, this would be nice. 

It can be an alternative way to have some support for tensors, next to an ExtensionType (ARROW-1614 / ARROW-5819)

Related discussions: https://lists.apache.org/thread.html/9b142c1709aa37dc35f1ce8db4e1ced94fcc4cdd96cc72b5772b373b@%3Cdev.arrow.apache.org%3E, https://github.com/apache/arrow/issues/4802



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)