You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2018/07/15 05:17:00 UTC

[jira] [Updated] (SPARK-24799) A solution of dealing with data skew in left,right,inner join

     [ https://issues.apache.org/jira/browse/SPARK-24799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated SPARK-24799:
------------------------------
    Target Version/s:   (was: 2.3.0)
       Fix Version/s:     (was: 2.3.0)

Have a quick look at [https://spark.apache.org/contributing.html] for guidance on filling out JIRAs. This one, yeah, is likely a duplicate of other general issues about skew.

> A solution of dealing with data skew in left,right,inner join
> -------------------------------------------------------------
>
>                 Key: SPARK-24799
>                 URL: https://issues.apache.org/jira/browse/SPARK-24799
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 2.0.0, 2.1.0, 2.2.0, 2.3.0
>            Reporter: marymwu
>            Priority: Major
>
> For the left,right,inner join statment execution, this solution is mainling about to devide the partions where the data skew has occured into serveral partions with smaller data scale, in order to parallelly execute more tasks to increase effeciency.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org