You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "GuangMing Lu (Jira)" <ji...@apache.org> on 2021/06/21 15:42:00 UTC
[jira] [Updated] (HIVE-25269) When the skew and parallel parameters
are true simultaneously, the result is less data
[ https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
GuangMing Lu updated HIVE-25269:
--------------------------------
Attachment: (was: test.sql)
> When the skew and parallel parameters are true simultaneously, the result is less data
> --------------------------------------------------------------------------------------
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
> Issue Type: Bug
> Components: Physical Optimizer, SQL
> Affects Versions: 3.1.0, 3.1.2
> Reporter: GuangMing Lu
> Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be reduced. Details of SQL and test data can be found in the attachment
--
This message was sent by Atlassian Jira
(v8.3.4#803005)