You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Weston Pace (Jira)" <ji...@apache.org> on 2022/10/31 19:00:00 UTC
[jira] [Created] (ARROW-18205) [C++] Substrait consumer is not converting right side references correctly on joins
Weston Pace created ARROW-18205:
-----------------------------------
Summary: [C++] Substrait consumer is not converting right side references correctly on joins
Key: ARROW-18205
URL: https://issues.apache.org/jira/browse/ARROW-18205
Project: Apache Arrow
Issue Type: Bug
Components: C++
Reporter: Weston Pace
The Substrait plan expresses a join condition as a logical expression like:
{{field(0) == field(3)}} where {{0}} and {{3}} are indices into the *combined* schema. These are then passed down to Acero which expects:
{{HashJoinNodeOptions(std::vector<FieldRef> in_left_keys, std::vector<FieldRef> in_right_keys)}}
However, {{in_left_keys}} are field references into the *left* schema and {{in_right_keys}} are field references into the *right* schema.
In other words, given the above expression ({{field(0) == field(3)}} if the schema were:
left:
key: int32
y: int32
z: int32
right:
key: int32
x: int32
Then {{in_left_keys}} should be {{field(0)}} (works correct today) and {{in_right_keys}} should be {{field(0)}} (today we are sending in {{field(3)}}).
--
This message was sent by Atlassian Jira
(v8.20.10#820010)