You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Tor Eivind McKenzie-Syvertsen (Jira)" <ji...@apache.org> on 2021/08/20 12:34:00 UTC
[jira] [Created] (ARROW-13681) pyarow.compute.list_parent_indices
only computes for first chunk
Tor Eivind McKenzie-Syvertsen created ARROW-13681:
-----------------------------------------------------
Summary: pyarow.compute.list_parent_indices only computes for first chunk
Key: ARROW-13681
URL: https://issues.apache.org/jira/browse/ARROW-13681
Project: Apache Arrow
Issue Type: Bug
Reporter: Tor Eivind McKenzie-Syvertsen
I came across this issue due to very unexpected behaviour from the "explode" function obtained here:
https://issues.apache.org/jira/browse/ARROW-12099
indices = pc.list_parent_indices(table[col_name])
if table[column] in this example contains several chunks, the indices will look perfectly fine for that chunk, but erratic and unexpected results for second chunk.
No warning or info was given either
A workaround that solved the problem for me is:
{code:java}
indices = pc.list_parent_indices(table.combine_chunks()[col_name])
{code}
The behaviour then changes dramatically.
I'm assuming this isnt expected and should be fixed?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)