You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Tor Eivind McKenzie-Syvertsen (Jira)" <ji...@apache.org> on 2021/08/20 12:34:00 UTC

[jira] [Created] (ARROW-13681) pyarow.compute.list_parent_indices only computes for first chunk

Tor Eivind McKenzie-Syvertsen created ARROW-13681:
-----------------------------------------------------

             Summary:  pyarow.compute.list_parent_indices only computes for first chunk
                 Key: ARROW-13681
                 URL: https://issues.apache.org/jira/browse/ARROW-13681
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Tor Eivind McKenzie-Syvertsen


I came across this issue due to very unexpected behaviour from the "explode" function obtained here:
https://issues.apache.org/jira/browse/ARROW-12099
indices = pc.list_parent_indices(table[col_name])

if table[column] in this example contains several chunks, the indices will look perfectly fine for that chunk, but erratic and unexpected results for second chunk.
No warning or info was given either

A workaround that solved the problem for me is:
{code:java}
  indices = pc.list_parent_indices(table.combine_chunks()[col_name])
{code}
The behaviour then changes dramatically.

I'm assuming this isnt expected and should be fixed?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)