You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by yucai <gi...@git.apache.org> on 2018/08/10 05:26:07 UTC

[GitHub] spark pull request #22066: [SPARK-25084][SQL] "distribute by" on multiple co...

GitHub user yucai opened a pull request:

    https://github.com/apache/spark/pull/22066

    [SPARK-25084][SQL] "distribute by" on multiple columns may lead to codegen issue

    ## What changes were proposed in this pull request?
    
    "distribute by" on multiple columns may lead to codegen issue
    
    ## How was this patch tested?
    
    UTs.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/yucai/spark SPARK-25084

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/22066.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #22066
    
----
commit 8ee56bbfaacdd64b1712d72650a39939ca3b13f2
Author: yucai <yy...@...>
Date:   2018-08-10T05:19:43Z

    "distribute by" on multiple columns may lead to codegen issue

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns m...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns m...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    **[Test build #94543 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/94543/testReport)** for PR 22066 at commit [`8ee56bb`](https://github.com/apache/spark/commit/8ee56bbfaacdd64b1712d72650a39939ca3b13f2).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns (...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns (...

Posted by yucai <gi...@git.apache.org>.

Github user yucai commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    @LantaoJin I realized the initial way had some issue, so I marked it as WIP to refine and add test. It is different from your original implementation, so I would like to use this one.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [WIP][SPARK-25084][SQL] "distribute by" on multiple colu...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    can you add a test first?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [WIP][SPARK-25084][SQL] "distribute by" on multiple colu...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns (...

Posted by yucai <gi...@git.apache.org>.

Github user yucai commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    @cloud-fan @jerryshao sure, I will do it.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns m...

Posted by yucai <gi...@git.apache.org>.

Github user yucai commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    @cloud-fan @gatorsmile PR has been ready, kindly help review.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns (...

Posted by yucai <gi...@git.apache.org>.

Github user yucai commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    @cloud-fan Jira and 1st is from this one. It is critical to our 2.3 migration.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns m...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    **[Test build #94557 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/94557/testReport)** for PR 22066 at commit [`931fa28`](https://github.com/apache/spark/commit/931fa28861f15ef1c31a51787f3bd59f2284de89).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns (...

Posted by LantaoJin <gi...@git.apache.org>.

Github user LantaoJin commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    Thank you @yucai . New PR #22077 for branch-2.3. Cc: @cloud-fan @jerryshao 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark issue #22066: [SPARK-25084][SQL] "distribute by" on multiple columns (...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22066
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org