You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Wenjun Si (Jira)" <ji...@apache.org> on 2020/03/15 01:19:00 UTC

[jira] [Created] (ARROW-8122) [Python] Empty numpy arrays with shape cannot be obtained in Plasma

Wenjun Si created ARROW-8122:
--------------------------------

             Summary: [Python] Empty numpy arrays with shape cannot be obtained in Plasma
                 Key: ARROW-8122
                 URL: https://issues.apache.org/jira/browse/ARROW-8122
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.16.0
            Reporter: Wenjun Si


In PyArrow 0.16.0, when we try to store an empty Numpy Array with shape into Plasma Store, for instance, np.array([[], []]), the put call works, but when we try to get it from Plasma Store, an ArrowInvalid is raised.

Code reproducing this error:

{code:python}
import numpy as np
import pyarrow.plasma
store_ctx = pyarrow.plasma.start_plasma_store(16 * 1024 * 1024)
store_vars = store_ctx.__enter__()
cli = pyarrow.plasma.connect(store_vars[0])
arr = np.array([[], []])
obj_id = cli.put(arr)
st_obj = cli.get(obj_id)  # this line cannot work
{code}

and the error stack is

{code:python}
Traceback (most recent call last):
  File "/Users/wenjun/miniconda3/lib/python3.7/site-packages/IPython/core/interactiveshell.py", line 3326, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)
  File "<ipython-input-9-463596c9dc4f>", line 1, in <module>
    st_obj = cli.get(obj_id)
  File "pyarrow/_plasma.pyx", line 580, in pyarrow._plasma.PlasmaClient.get
  File "pyarrow/_plasma.pyx", line 573, in pyarrow._plasma.PlasmaClient.get
  File "pyarrow/serialization.pxi", line 476, in pyarrow.lib.deserialize
  File "pyarrow/serialization.pxi", line 438, in pyarrow.lib.deserialize_from
  File "pyarrow/serialization.pxi", line 414, in pyarrow.lib.read_serialized
  File "pyarrow/error.pxi", line 84, in pyarrow.lib.check_status
pyarrow.lib.ArrowInvalid: strides must not involve buffer over run
{code}

The same code works in PyArrow 0.15.x




--
This message was sent by Atlassian Jira
(v8.3.4#803005)