You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by aray <gi...@git.apache.org> on 2017/01/13 15:23:40 UTC

[GitHub] spark pull request #16577: [SPARK-19214][SQL] Typed aggregate count output f...

GitHub user aray opened a pull request:

    https://github.com/apache/spark/pull/16577

    [SPARK-19214][SQL] Typed aggregate count output field name should be "count"

    ## What changes were proposed in this pull request?
    
    Changes the output field name of typed aggregate counts to be `count` (instead of `count(1)`) for consistency with the dataframe api
    
    ## How was this patch tested?
    
    unit test

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/aray/spark typed-count-name

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/16577.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #16577
    
----
commit 37319fe312b37f67078e41e3621171893aa20a92
Author: Andrew Ray <ra...@gmail.com>
Date:   2017-01-13T15:20:25Z

    typed aggregate count output field name should be "count"

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    **[Test build #71330 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71330/testReport)** for PR 16577 at commit [`37319fe`](https://github.com/apache/spark/commit/37319fe312b37f67078e41e3621171893aa20a92).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #16577: [SPARK-19214][SQL] Typed aggregate count output f...

Posted by aray <gi...@git.apache.org>.
Github user aray closed the pull request at:

    https://github.com/apache/spark/pull/16577


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    (It just rings a bell to me. It seems it is okay to break the default names if it seems required - https://github.com/apache/spark/pull/14444#issuecomment-246932928)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    The problem is just the default field name? hm, the problem is that this changes the output and could break code. I am not sure these are necessarily meant to be consistent -- I haven't looked at the behavior of other DataFrame/Dataset methods though I expect _those_ to be consistent. At best it may have to be a known issue, and of course anyone should name fields explicitly if they depend on the name.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    **[Test build #71330 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71330/testReport)** for PR 16577 at commit [`37319fe`](https://github.com/apache/spark/commit/37319fe312b37f67078e41e3621171893aa20a92).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71330/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    Regarding the default name of the expressions, this is not the only issue if we want to make them more readable. Maybe we can close it at first? Revisit the whole issue later?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    @aray WDYT on ^?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #16577: [SPARK-19214][SQL] Typed aggregate count output field na...

Posted by aray <gi...@git.apache.org>.
Github user aray commented on the issue:

    https://github.com/apache/spark/pull/16577
  
    Closing since it does not look like there is any interest in changing this. Thanks everyone!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org