You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by dilipbiswal <gi...@git.apache.org> on 2016/05/11 06:09:29 UTC

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

GitHub user dilipbiswal opened a pull request:

    https://github.com/apache/spark/pull/13045

    [SPARK-15114][SQL] Column name generated by typed aggregate is super verbose

    ## What changes were proposed in this pull request?
    
    Generate a shorter default alias for `AggregateExpression `, In this PR, aggregate function name along with a index is used for generating the alias name.
    
    ```SQL
    val ds = Seq(1, 3, 2, 5).toDS()
    ds.select(typed.sum((i: Int) => i), typed.avg((i: Int) => i)).show()
    ```
    
    Output before change.
    ```SQL
    +-----------------------------------------------------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------+
    |typedsumdouble(unresolveddeserializer(upcast(input[0, int], IntegerType, - root class: "scala.Int"), value#1), upcast(value))|typedaverage(unresolveddeserializer(upcast(input[0, int], IntegerType, - root class: "scala.Int"), value#1), newInstance(class scala.Tuple2))|
    +-----------------------------------------------------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------+
    |                                                                                                                         11.0|                                                                                                                                         2.75|
    +-----------------------------------------------------------------------------------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------+
    ```
    Output after change:
    ```SQL
    +-----------------+---------------+
    |typedsumdouble_c1|typedaverage_c2|
    +-----------------+---------------+
    |             11.0|           2.75|
    +-----------------+---------------+
    ```
    
    Note: There is one test in ParquetSuites.scala which shows that that the system picked alias
    name is not usable and is rejected.  [test](https://github.com/apache/spark/blob/master/sql/hive/src/test/scala/org/apache/spark/sql/hive/parquetSuites.scala#L672-#L687)
    ## How was this patch tested?
    
    A new test was added in DataSetAggregatorSuite. 


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/dilipbiswal/spark spark-15114

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/13045.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #13045
    
----
commit 65375b41605ebe8d506a11e255ee5b1efa54e36d
Author: Dilip Biswal <db...@us.ibm.com>
Date:   2016-05-10T23:33:30Z

    [SPARK-15114] Column name generated by typed aggregate is super verbose

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218581069
  
    cc @yhuai @cloud-fan @gatorsmile 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63992015
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Column.scala ---
    @@ -37,6 +38,14 @@ private[sql] object Column {
       def apply(expr: Expression): Column = new Column(expr)
     
       def unapply(col: Column): Option[Expression] = Some(col.expr)
    +
    +  private[sql] def generateAlias(e: Expression, index: Int): String = {
    +    e match {
    +      case a: AggregateExpression if a.aggregateFunction.isInstanceOf[TypedAggregateExpression] =>
    +        s"${a.aggregateFunction.prettyName}_c${index}"
    --- End diff --
    
    ok.. let me get the output for you and paste it here so its easier to decide. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62968394
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    @cloud-fan I see. Thank you for clarifying :-)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63996737
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Column.scala ---
    @@ -37,6 +38,14 @@ private[sql] object Column {
       def apply(expr: Expression): Column = new Column(expr)
     
       def unapply(col: Column): Option[Expression] = Some(col.expr)
    +
    +  private[sql] def generateAlias(e: Expression, index: Int): String = {
    +    e match {
    +      case a: AggregateExpression if a.aggregateFunction.isInstanceOf[TypedAggregateExpression] =>
    +        s"${a.aggregateFunction.prettyName}_c${index}"
    --- End diff --
    
    SGTM


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218446117
  
    **[Test build #58365 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58365/consoleFull)** for PR 13045 at commit [`256f9e9`](https://github.com/apache/spark/commit/256f9e98c72b0368afbd95f9e953414447461ee1).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220242851
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62965291
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    @cloud-fan Very sorry..  here is the [test](https://github.com/apache/spark/blob/master/sql/hive/src/test/scala/org/apache/spark/sql/hive/parquetSuites.scala#L706-#L719)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220242701
  
    **[Test build #58850 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58850/consoleFull)** for PR 13045 at commit [`9eb6f40`](https://github.com/apache/spark/commit/9eb6f4063adaf7cda79cdf0bf2ac11414ca5c1d2).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63994025
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Column.scala ---
    @@ -37,6 +38,14 @@ private[sql] object Column {
       def apply(expr: Expression): Column = new Column(expr)
     
       def unapply(col: Column): Option[Expression] = Some(col.expr)
    +
    +  private[sql] def generateAlias(e: Expression, index: Int): String = {
    +    e match {
    +      case a: AggregateExpression if a.aggregateFunction.isInstanceOf[TypedAggregateExpression] =>
    +        s"${a.aggregateFunction.prettyName}_c${index}"
    --- End diff --
    
    @cloud-fan Looks like following. Lets go with this ? I will drop the index parameter.
    
    ```SQL
    +-------------------+-------------------+
    |TypedSumDouble(int)|TypedSumDouble(int)|
    +-------------------+-------------------+
    |               11.0|               11.0|
    +-------------------+-------------------+
    ```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218930711
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58527/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220121879
  
    **[Test build #58809 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58809/consoleFull)** for PR 13045 at commit [`c2f651d`](https://github.com/apache/spark/commit/c2f651da7495d2fe6e0dbe37e5a303cb0c91affc).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62964226
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    @cloud-fan Actually, initially i had it scoped only to wrap the expression with UnresolvedAlias for TypedAggregateExpression. I then stumbled up on this  [test](https://github.com/apache/spark/blob/master/sql/hive/src/test/scala/org/apache/spark/sql/hive/parquetSuites.scala#L672-#L687). 
    
    Wenchen, what do you think about the above test ? Basically this prompted me to widen the scope to generate simpler and shorter (i agree, its not better) for AggregateExpressions instead of limiting it to typed aggregates. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220433675
  
    **[Test build #58896 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58896/consoleFull)** for PR 13045 at commit [`689c513`](https://github.com/apache/spark/commit/689c51311d31765c715edc94e4ac28e798c761c5).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220562032
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58970/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63991063
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala ---
    @@ -174,14 +174,16 @@ class Analyzer(
         private def assignAliases(exprs: Seq[NamedExpression]) = {
           exprs.zipWithIndex.map {
             case (expr, i) =>
    -          expr.transformUp { case u @ UnresolvedAlias(child, optionalAliasName) =>
    +          expr.transformUp { case u @ UnresolvedAlias(child, optGenAliasFunc) =>
                 child match {
                   case ne: NamedExpression => ne
                   case e if !e.resolved => u
                   case g: Generator => MultiAlias(g, Nil)
                   case c @ Cast(ne: NamedExpression, _) => Alias(c, ne.name)()
                   case e: ExtractValue => Alias(e, toPrettySQL(e))()
    -              case e => Alias(e, optionalAliasName.getOrElse(toPrettySQL(e)))()
    +              case e if optGenAliasFunc.isDefined =>
    +                Alias(child, s"${optGenAliasFunc.get.apply(e, i + 1)}")()
    --- End diff --
    
    nit: we can just use `optGenAliasFunc.get.apply(e, i + 1)`, no need to wrap it with `s"${}"` ...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by yhuai <gi...@git.apache.org>.

Github user yhuai commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218661443
  
    How about we just improve the column name of fields generated by typed aggregation. Seems we are also changing other cases?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218381999
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58334/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218931189
  
    @cloud-fan Hi Wenchen, can you please look over the change and let me know what you think ? I had a question for you. I tried to keep the expression un-resolved and resolve it in Analyzer but since TypedAggregateExpression is in core, i can't seem to access it from analysis package. So i am not generating an resolved expression in Column.named. Let me know what you think..


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220102264
  
    **[Test build #58798 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58798/consoleFull)** for PR 13045 at commit [`0448fa5`](https://github.com/apache/spark/commit/0448fa5e4c2bae033d272f9ffb8716614fcd1adc).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218424765
  
    **[Test build #58357 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58357/consoleFull)** for PR 13045 at commit [`660af50`](https://github.com/apache/spark/commit/660af5064cd93d1860be8f100aad227351c06f96).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218424994
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58357/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220544365
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218406705
  
    **[Test build #58357 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58357/consoleFull)** for PR 13045 at commit [`660af50`](https://github.com/apache/spark/commit/660af5064cd93d1860be8f100aad227351c06f96).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218446257
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58365/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220147299
  
    **[Test build #58809 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58809/consoleFull)** for PR 13045 at commit [`c2f651d`](https://github.com/apache/spark/commit/c2f651da7495d2fe6e0dbe37e5a303cb0c91affc).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218575357
  
    **[Test build #58393 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58393/consoleFull)** for PR 13045 at commit [`79ad9fc`](https://github.com/apache/spark/commit/79ad9fccb289580f0a5675d7db52b485aefd86b7).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218381873
  
    **[Test build #58334 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58334/consoleFull)** for PR 13045 at commit [`65375b4`](https://github.com/apache/spark/commit/65375b41605ebe8d506a11e255ee5b1efa54e36d).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220784325
  
    thanks, merging to master and 2.0!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218930608
  
    **[Test build #58527 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58527/consoleFull)** for PR 13045 at commit [`f92e393`](https://github.com/apache/spark/commit/f92e3934b7e098f86c918dc229d8da1c12f95189).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220561880
  
    **[Test build #58970 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58970/consoleFull)** for PR 13045 at commit [`1a83497`](https://github.com/apache/spark/commit/1a83497be1d294b200943a6c1aa077a2b2b9dd46).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63991309
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Column.scala ---
    @@ -37,6 +38,14 @@ private[sql] object Column {
       def apply(expr: Expression): Column = new Column(expr)
     
       def unapply(col: Column): Option[Expression] = Some(col.expr)
    +
    +  private[sql] def generateAlias(e: Expression, index: Int): String = {
    +    e match {
    +      case a: AggregateExpression if a.aggregateFunction.isInstanceOf[TypedAggregateExpression] =>
    +        s"${a.aggregateFunction.prettyName}_c${index}"
    --- End diff --
    
    how about `aggregateFunction.toString`? It carries more information and not that verbose.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62959660
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    I don't think `avg_c1` is better than `avg(age)`. I agree the sql string of typed aggregate is too verbose, but I prefer to only simplify it, but not all aggregegate expressions.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218920807
  
    **[Test build #58527 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58527/consoleFull)** for PR 13045 at commit [`f92e393`](https://github.com/apache/spark/commit/f92e3934b7e098f86c918dc229d8da1c12f95189).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63991134
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala ---
    @@ -325,10 +325,13 @@ case class UnresolvedExtractValue(child: Expression, extraction: Expression)
      * Holds the expression that has yet to be aliased.
      *
      * @param child The computation that is needs to be resolved during analysis.
    - * @param aliasName The name if specified to be associated with the result of computing [[child]]
    + * @param aliasFunc The function if specified to be called to generate an alias to associate
    --- End diff --
    
    we need to say more about the 2 parameters this `aliasFunc` takes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218575667
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58393/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62966720
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    @cloud-fan 
    val df = Seq(1, 2, 3).map(i => (i, i.toString)).toDF("int", "str")
    val df2 = df.as('x).join(df.as('y), $"x.str" === $"y.str").groupBy("y.str").max("y.int")
    intercept[Throwable](df2.write.parquet(filePath))
    
    I am reading it as though, we don't like the alias names that is generated by us (system picked) ? Shouldn't this write go through without any problem ? When i changed the alias name to a simpler one like max_c0 instead of max(c0) , this test started failing and thats how i came to know about it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220515698
  
    cc @cloud-fan Hi Wenchen, I have made the changes per your comments. Could you please look through it when you get a chance ? Thanks..


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220128719
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58798/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by rxin <gi...@git.apache.org>.

Github user rxin commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220763498
  
    cc @cloud-fan 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218446255
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220197500
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58822/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220456786
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58896/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62966223
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    What's the problem with it? Expecting to see special chars in aggregation attribute but it doesn't?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218381997
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220530378
  
    **[Test build #58957 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58957/consoleFull)** for PR 13045 at commit [`f410999`](https://github.com/apache/spark/commit/f410999f4071e82b89836b64ab4cdee419edded2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by asfgit <gi...@git.apache.org>.

Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/13045


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220456509
  
    **[Test build #58896 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58896/consoleFull)** for PR 13045 at commit [`689c513`](https://github.com/apache/spark/commit/689c51311d31765c715edc94e4ac28e798c761c5).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220544204
  
    **[Test build #58957 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58957/consoleFull)** for PR 13045 at commit [`f410999`](https://github.com/apache/spark/commit/f410999f4071e82b89836b64ab4cdee419edded2).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220147651
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220562029
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220147655
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58809/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63809763
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/unresolved.scala ---
    @@ -326,9 +326,14 @@ case class UnresolvedExtractValue(child: Expression, extraction: Expression)
      *
      * @param child The computation that is needs to be resolved during analysis.
      * @param aliasName The name if specified to be associated with the result of computing [[child]]
    + * @param aliasFunc The function if specified to be called to generate an alias to associate
    + *                  with the result of computing [[child]]
      *
      */
    -case class UnresolvedAlias(child: Expression, aliasName: Option[String] = None)
    +case class UnresolvedAlias(
    +    child: Expression,
    +    aliasName: Option[String] = None,
    +    aliasFunc: Option[Expression => String] = None)
    --- End diff --
    
    I think `aliasFunc` is enough. Ideally `aliasName` is also a function that ignore the parameter and return a string.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220654844
  
    **[Test build #58999 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58999/consoleFull)** for PR 13045 at commit [`04a2fd9`](https://github.com/apache/spark/commit/04a2fd96033485d0a2d0396bdc0e1cbdd9b777b7).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63991215
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetAggregatorSuite.scala ---
    @@ -240,4 +240,15 @@ class DatasetAggregatorSuite extends QueryTest with SharedSQLContext {
         val df2 = Seq(1 -> "a", 2 -> "b", 3 -> "b").toDF("i", "j")
         checkAnswer(df2.agg(RowAgg.toColumn as "b").select("b"), Row(6) :: Nil)
       }
    +
    +  test("spark-15114 shorter system generated alias names") {
    +    val ds = Seq(1, 3, 2, 5).toDS()
    +    assert(ds.select(typed.sum((i: Int) => i)).columns.head === "typedsumdouble_c1")
    +    val ds2 = ds.select(typed.sum((i: Int) => i), typed.avg((i: Int) => i))
    +    assert(ds2.columns.head === "typedsumdouble_c1")
    --- End diff --
    
    I'm not sure how useful this `_c1` postfix is, maybe we can remove it and simplify the `aliasFunc`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218548936
  
    **[Test build #58393 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58393/consoleFull)** for PR 13045 at commit [`79ad9fc`](https://github.com/apache/spark/commit/79ad9fccb289580f0a5675d7db52b485aefd86b7).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218930710
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220233152
  
    **[Test build #58850 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58850/consoleFull)** for PR 13045 at commit [`9eb6f40`](https://github.com/apache/spark/commit/9eb6f4063adaf7cda79cdf0bf2ac11414ca5c1d2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220184467
  
    **[Test build #58822 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58822/consoleFull)** for PR 13045 at commit [`21c58bc`](https://github.com/apache/spark/commit/21c58bc2525b7863a367c66050ef9a6f90962da2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220633250
  
    **[Test build #58999 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58999/consoleFull)** for PR 13045 at commit [`04a2fd9`](https://github.com/apache/spark/commit/04a2fd96033485d0a2d0396bdc0e1cbdd9b777b7).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218429665
  
    **[Test build #58365 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58365/consoleFull)** for PR 13045 at commit [`256f9e9`](https://github.com/apache/spark/commit/256f9e98c72b0368afbd95f9e953414447461ee1).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218370929
  
    **[Test build #58334 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58334/consoleFull)** for PR 13045 at commit [`65375b4`](https://github.com/apache/spark/commit/65375b41605ebe8d506a11e255ee5b1efa54e36d).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220242854
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58850/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62967194
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    system generated alias name is just for display, if we are going to write data out, users should manually alias it with a more formal name.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218575662
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r62965026
  
    --- Diff: python/pyspark/sql/dataframe.py ---
    @@ -861,11 +861,11 @@ def groupBy(self, *cols):
                 Each element should be a column name (string) or an expression (:class:`Column`).
     
             >>> df.groupBy().avg().collect()
    -        [Row(avg(age)=3.5)]
    +        [Row(avg_c1=3.5)]
    --- End diff --
    
    looks like you link to the wrong test, which test are you talking about?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220655113
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220456781
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218662623
  
    @yhuai @cloud-fan Sure. I will change it only for typed aggregation.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220197302
  
    **[Test build #58822 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58822/consoleFull)** for PR 13045 at commit [`21c58bc`](https://github.com/apache/spark/commit/21c58bc2525b7863a367c66050ef9a6f90962da2).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by cloud-fan <gi...@git.apache.org>.

Github user cloud-fan commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-219670131
  
    I don't have a good idea either, one possible approach is improving `UnresolvedAlias` to take a function as the optional alias name, and the function will take the resolved expression and generate alias name.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220197499
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220128712
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220544367
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58957/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220655117
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/58999/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220546138
  
    **[Test build #58970 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58970/consoleFull)** for PR 13045 at commit [`1a83497`](https://github.com/apache/spark/commit/1a83497be1d294b200943a6c1aa077a2b2b9dd46).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by SparkQA <gi...@git.apache.org>.

Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-220128385
  
    **[Test build #58798 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/58798/consoleFull)** for PR 13045 at commit [`0448fa5`](https://github.com/apache/spark/commit/0448fa5e4c2bae033d272f9ffb8716614fcd1adc).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds the following public classes _(experimental)_:
      * `case class UnresolvedAlias(`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by AmplabJenkins <gi...@git.apache.org>.

Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/13045#issuecomment-218424992
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-15114][SQL] Column name generated by ty...

Posted by dilipbiswal <gi...@git.apache.org>.

Github user dilipbiswal commented on a diff in the pull request:

    https://github.com/apache/spark/pull/13045#discussion_r63991554
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetAggregatorSuite.scala ---
    @@ -240,4 +240,15 @@ class DatasetAggregatorSuite extends QueryTest with SharedSQLContext {
         val df2 = Seq(1 -> "a", 2 -> "b", 3 -> "b").toDF("i", "j")
         checkAnswer(df2.agg(RowAgg.toColumn as "b").select("b"), Row(6) :: Nil)
       }
    +
    +  test("spark-15114 shorter system generated alias names") {
    +    val ds = Seq(1, 3, 2, 5).toDS()
    +    assert(ds.select(typed.sum((i: Int) => i)).columns.head === "typedsumdouble_c1")
    +    val ds2 = ds.select(typed.sum((i: Int) => i), typed.avg((i: Int) => i))
    +    assert(ds2.columns.head === "typedsumdouble_c1")
    --- End diff --
    
    @cloud-fan Just wanted to show some difference to user between two aggregate expressions like sum(col1), sum(col2) will show up as typedsumdouble_c1 and typedsumdouble_c2. You think its fine to just report without any suffix ? If you think its ok, then may be we can just create resolved Aliases in Column.named as opposed to deferring it to Analyzer ? Please let me know.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org