You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Maarten Breddels (Jira)" <ji...@apache.org> on 2020/12/03 18:14:00 UTC

[jira] [Created] (ARROW-10799) [C++] Take on string chunked arrays slow and fails

Maarten Breddels created ARROW-10799:
----------------------------------------

             Summary: [C++] Take on string chunked arrays slow and fails
                 Key: ARROW-10799
                 URL: https://issues.apache.org/jira/browse/ARROW-10799
             Project: Apache Arrow
          Issue Type: Bug
          Components: C++
            Reporter: Maarten Breddels


 
{code:java}
import pyarrow as pa
a = pa.array(['a'] * 2**26)
c = pa.chunked_array([a] * 2*18)
c.take([0, 1])
{code}
Gives
{noformat}
----------------------------------------
ArrowInvalidTraceback (most recent call last)
<ipython-input-4-57099ee02815> in <module>
----> 1 c.take([0, 1])

~/github/apache/arrow/python/pyarrow/table.pxi in pyarrow.lib.ChunkedArray.take()

~/github/apache/arrow/python/pyarrow/compute.py in take(data, indices, boundscheck, memory_pool)
    421     """
    422     options = TakeOptions(boundscheck=boundscheck)
--> 423     return call_function('take', [data, indices], options, memory_pool)
    424 
    425 

~/github/apache/arrow/python/pyarrow/_compute.pyx in pyarrow._compute.call_function()

~/github/apache/arrow/python/pyarrow/_compute.pyx in pyarrow._compute.Function.call()

~/github/apache/arrow/python/pyarrow/error.pxi in pyarrow.lib.pyarrow_internal_check_status()

~/github/apache/arrow/python/pyarrow/error.pxi in pyarrow.lib.check_status()

ArrowInvalid: offset overflow while concatenating arrays
{noformat}
 

PS: did not check master but  3.0.0.dev238+gb0bc9f8d

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)