You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by gvramana <gi...@git.apache.org> on 2015/02/09 08:32:04 UTC

[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

GitHub user gvramana opened a pull request:

    https://github.com/apache/spark/pull/4466

    [SPARK-5680][SQL] Sum function on all null values, should return zero

    SELECT sum('a'), avg('a'), variance('a'), std('a') FROM src;
    Should give output as 
    0.0	NULL	NULL	NULL
    This fixes hive udaf_number_format.q
    
    Author: Venkata Ramana G <ra...@huawei.com>

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gvramana/spark sum_fix

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/4466.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #4466
    
----
commit bee6882f07d8ebd0f177f1dbdf315b4428554aa2
Author: Venkata Ramana Gollamudi <ra...@huawei.com>
Date:   2015-02-06T08:52:32Z

    fix sum function for all null data

commit d4287c8c211e4d88e643e2b2e4a9146f3bda23bd
Author: Venkata Ramana Gollamudi <ra...@huawei.com>
Date:   2015-02-09T07:11:25Z

    Add udaf_number_format to whitelist and golden answer

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-79517401
  
    Perhaps I am missing something, but why do we need a new `CombineSumFunction`?  Can we not just fix the semantics of `SumFunction`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-83982500
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28918/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-74500153
  
    @marmbrus , Please review and merge the same. Thanks


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73853989
  
      [Test build #27287 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27287/consoleFull) for   PR 4466 at commit [`94d4ec6`](https://github.com/apache/spark/commit/94d4ec69a5ff7dbe7b025eb8f758cec5eafe2d05).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-83982493
  
      [Test build #28918 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28918/consoleFull) for   PR 4466 at commit [`242ef74`](https://github.com/apache/spark/commit/242ef745d441063e691c44b6f99aa636688f6aa6).
     * This patch **passes all tests**.
     * This patch **does not merge cleanly**.
     * This patch adds the following public classes _(experimental)_:
      * `case class CombineSum(child: Expression) extends AggregateExpression `
      * `case class CombineSumFunction(expr: Expression, base: AggregateExpression)`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-85063529
  
    @yhuai, Submitted PR for the code gen with Sum  #5138 . Thanks


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73863265
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27287/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by yhuai <gi...@git.apache.org>.
Github user yhuai commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84463533
  
    @gvramana Can you make a pr to make code gen work with Sum? We need to handle ComebineSum at there. `canBeCodeGened` in `HashAggregation` strategy and `GeneratedAggregate` are the places to update. Thanks.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84449808
  
    Thanks!  Merged into master and 1.3.
    
    BTW, the author tag is automatically added by our merge script so you don't need to put it in the description of the PR :)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73619385
  
      [Test build #27139 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27139/consoleFull) for   PR 4466 at commit [`d4287c8`](https://github.com/apache/spark/commit/d4287c8c211e4d88e643e2b2e4a9146f3bda23bd).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73467653
  
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84071542
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/28923/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84071518
  
      [Test build #28923 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28923/consoleFull) for   PR 4466 at commit [`42e14d1`](https://github.com/apache/spark/commit/42e14d15eacda30d032e8d913e9edd0718c8e45f).
     * This patch **passes all tests**.
    
     * This patch merges cleanly.
     * This patch adds the following public classes _(experimental)_:
      * `case class CombineSum(child: Expression) extends AggregateExpression `
      * `case class CombineSumFunction(expr: Expression, base: AggregateExpression)`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84052263
  
      [Test build #28923 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28923/consoleFull) for   PR 4466 at commit [`42e14d1`](https://github.com/apache/spark/commit/42e14d15eacda30d032e8d913e9edd0718c8e45f).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-74011625
  
    Case classes cannot be derived. Making it non-case class will disturb the uniformity. so I left them like that.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by yhuai <gi...@git.apache.org>.
Github user yhuai commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84509403
  
    I have created https://issues.apache.org/jira/browse/SPARK-6451.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84809539
  
    I will handle the same, thanks


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73854747
  
    Original fix could not distinguish partial sum expression vs no partitions case. Fixed the same now. @marmbrus, @rxin, please review and test. Thanks


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73619276
  
      [Test build #27139 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27139/consoleFull) for   PR 4466 at commit [`d4287c8`](https://github.com/apache/spark/commit/d4287c8c211e4d88e643e2b2e4a9146f3bda23bd).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73618901
  
    ok to test


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73653867
  
    Fixed the same. Thanks. Please retest.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-84051472
  
    Rebased


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73863258
  
      [Test build #27287 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27287/consoleFull) for   PR 4466 at commit [`94d4ec6`](https://github.com/apache/spark/commit/94d4ec69a5ff7dbe7b025eb8f758cec5eafe2d05).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds the following public classes _(experimental)_:
      * `case class CombineSum(child: Expression) extends AggregateExpression `
      * `case class CombineSumFunction(expr: Expression, base: AggregateExpression)`



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-83958070
  
      [Test build #28918 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/28918/consoleFull) for   PR 4466 at commit [`242ef74`](https://github.com/apache/spark/commit/242ef745d441063e691c44b6f99aa636688f6aa6).
     * This patch **does not merge cleanly**.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73619386
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27139/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-83957817
  
    Updated with comments


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73654060
  
      [Test build #27183 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27183/consoleFull) for   PR 4466 at commit [`521eb03`](https://github.com/apache/spark/commit/521eb036ff079666864325c52d4cc540f335cbe7).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-81994981
  
    Can you add this explanation to `CombineSumFunction`'s scala doc?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73654932
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27183/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73654924
  
      [Test build #27183 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27183/consoleFull) for   PR 4466 at commit [`521eb03`](https://github.com/apache/spark/commit/521eb036ff079666864325c52d4cc540f335cbe7).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by rxin <gi...@git.apache.org>.
Github user rxin commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73628036
  
    The patch is failing style checker. @gvramana can you update it?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/4466


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by adrian-wang <gi...@git.apache.org>.
Github user adrian-wang commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-73971304
  
    The only difference of `CombineSumFunction` and `SumFunction` is in `update`, can we just derive `CombineSumFunction` from `SumFunction` and overwrite the `update` function?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by adrian-wang <gi...@git.apache.org>.
Github user adrian-wang commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-74014984
  
    That makes sense, thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-5680][SQL] Sum function on all null val...

Posted by gvramana <gi...@git.apache.org>.
Github user gvramana commented on the pull request:

    https://github.com/apache/spark/pull/4466#issuecomment-81427255
  
    We need to satisfy 3 cases:
    1) sum of all null values = zero
    2) Sum for table column with no data = Null
    3) sum of column with null and not null values = sum of not null values
    
    i.e.,
    <table border=1>
    <tr><th>Combining level</th><th>Paritition(s) level</th><th>Input data</th></tr>
    <tr><td>Zero </td><td> &lt;-- Zero(f1) </td><td>&lt;-- null <br/> &lt;-- null <br/> &lt;-- null </td></tr>
    <tr><td>null(f2)</td><td>&lt;-- null<br/> &lt;-- null <br/> &lt;-- null</td><td> &lt;-- No data <br/>&lt;-- No data<br/> &lt;-- No data</td></tr>
    </table>
    if same aggregate expression has to work at partition level and combining level. It cannot distinguish between f1 and f2 cases. As there is no way to know "No data" case at combining expression.
    So expressions are separated.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org