You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "GuangMing Lu (Jira)" <ji...@apache.org> on 2021/06/21 15:42:00 UTC

[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

     [ https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

GuangMing Lu updated HIVE-25269:
--------------------------------
    Attachment:     (was: test.sql)

> When the skew and parallel parameters are true simultaneously, the result is less data
> --------------------------------------------------------------------------------------
>
>                 Key: HIVE-25269
>                 URL: https://issues.apache.org/jira/browse/HIVE-25269
>             Project: Hive
>          Issue Type: Bug
>          Components: Physical Optimizer, SQL
>    Affects Versions: 3.1.0, 3.1.2
>            Reporter: GuangMing Lu
>            Priority: Major
>         Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)