You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Alex Rodoni (JIRA)" <ji...@apache.org> on 2018/04/19 18:49:00 UTC

[jira] [Closed] (IMPALA-6867) Impala 2.12 & 3.0 Docs: Provide a query option to not shuffle on distinct exprs

     [ https://issues.apache.org/jira/browse/IMPALA-6867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alex Rodoni closed IMPALA-6867.
-------------------------------
       Resolution: Fixed
    Fix Version/s: Impala 2.12.0
                   Impala 3.0

> Impala 2.12 & 3.0 Docs: Provide a query option to not shuffle on distinct exprs
> -------------------------------------------------------------------------------
>
>                 Key: IMPALA-6867
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6867
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Docs
>    Affects Versions: Impala 3.0, Impala 2.12.0
>            Reporter: Alex Rodoni
>            Assignee: Alex Rodoni
>            Priority: Major
>             Fix For: Impala 3.0, Impala 2.12.0
>
>
> https://gerrit.cloudera.org/#/c/9949/
> New query option:
>  SHUFFLE_DISTINCT_EXPRS
> This options controls the shuffling behavior when a query has both grouping and distinct exprs. Impala can optionally include the distinct exprs in the hash exchange of the first aggregation phase to spread the data among more nodes. However, this plan requires another hash exchange on the grouping exprs in the second phase which is not required when omitting the distinct exprs in the first phase. Turning it off is recommended if the NDVs of the grouping exprs is high.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)