You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Chenxi Li (Jira)" <ji...@apache.org> on 2021/06/03 11:03:00 UTC
[jira] [Created] (ARROW-12942) [C++] The result of Arrow compute
hash_min_max is incorrect if there are new groups in the subsequent chunks
Chenxi Li created ARROW-12942:
---------------------------------
Summary: [C++] The result of Arrow compute hash_min_max is incorrect if there are new groups in the subsequent chunks
Key: ARROW-12942
URL: https://issues.apache.org/jira/browse/ARROW-12942
Project: Apache Arrow
Issue Type: Bug
Affects Versions: 4.0.1, 4.0.0
Reporter: Chenxi Li
Assignee: Chenxi Li
If there are new groups in the subsequent chunks of a chunked array, the result of Arrow compute hash_min_max is incorrect.
For example, a table with two chunks, the second chunk has a new group key
{code:java}
First chunk: {"argument": 1, "key": 0},
Second chunk: {"argument": 0, "key": 1}
{code}
the result of hash_min_max by "key" with such data is
{code:java}
[{"min": null, "max": null}, 0],
[{"min": 0, "max": 0}, 1]
{code}
But it should be
{code:java}
[{"min": 1, "max": 1}, 0],
[{"min": 0, "max": 0}, 1]
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)