You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Jesus Camacho Rodriguez (JIRA)" <ji...@apache.org> on 2016/06/26 02:14:37 UTC
[jira] [Created] (HIVE-14096) Extend RS dedup logic to merge GBy
operators
Jesus Camacho Rodriguez created HIVE-14096:
----------------------------------------------
Summary: Extend RS dedup logic to merge GBy operators
Key: HIVE-14096
URL: https://issues.apache.org/jira/browse/HIVE-14096
Project: Hive
Issue Type: Bug
Components: Physical Optimizer
Affects Versions: 2.2.0
Reporter: Jesus Camacho Rodriguez
Since we always generate map-side GBy at plan generation time, there are occasion when we could collapse GBy after RS dedup optimization. This means that GBy would be executed in a single stage with {{mode = complete}}.
Example in {{reduce_deduplicate_extended2.q.out}}:
{noformat}
SELECT f.key, g.value
FROM src f
JOIN src g ON (f.key = g.key AND f.value = g.value)
GROUP BY g.value, f.key
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)