You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2018/07/15 05:18:00 UTC

[jira] [Resolved] (SPARK-24799) A solution of dealing with data skew in left,right,inner join

     [ https://issues.apache.org/jira/browse/SPARK-24799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen resolved SPARK-24799.
-------------------------------
    Resolution: Duplicate

> A solution of dealing with data skew in left,right,inner join
> -------------------------------------------------------------
>
>                 Key: SPARK-24799
>                 URL: https://issues.apache.org/jira/browse/SPARK-24799
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 2.0.0, 2.1.0, 2.2.0, 2.3.0
>            Reporter: marymwu
>            Priority: Major
>
> For the left,right,inner join statment execution, this solution is mainling about to devide the partions where the data skew has occured into serveral partions with smaller data scale, in order to parallelly execute more tasks to increase effeciency.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org