You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2021/05/07 12:28:00 UTC

[jira] [Updated] (IMPALA-9951) Skew in analytic sorts when partition key has low cardinality

     [ https://issues.apache.org/jira/browse/IMPALA-9951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Quanlong Huang updated IMPALA-9951:
-----------------------------------
    Target Version: Impala 4.1  (was: Impala 4.0)

> Skew in analytic sorts when partition key has low cardinality
> -------------------------------------------------------------
>
>                 Key: IMPALA-9951
>                 URL: https://issues.apache.org/jira/browse/IMPALA-9951
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Frontend
>            Reporter: Tim Armstrong
>            Priority: Major
>              Labels: multithreading, tpcds
>
> In queries like TPC-DS Q67, the cardinality of the PARTITION BY expression of the analytic may be much lower than the parallelism of the input fragment. In this case the runtime of the sort can be skewed. We could mitigate the problem by doing the expensive sort *before* the exchange, so that the analytic fragment only needs to merge together its sorted input and evaluate the analytic over it.
> The impact of this is greater with multithreading, so I am considering only change the default when mt_dop > 0



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org