You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Apache Spark (Jira)" <ji...@apache.org> on 2022/12/20 23:04:00 UTC

[jira] [Assigned] (SPARK-41635) GROUP BY ALL

     [ https://issues.apache.org/jira/browse/SPARK-41635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-41635:
------------------------------------

    Assignee: Apache Spark  (was: Reynold Xin)

> GROUP BY ALL
> ------------
>
>                 Key: SPARK-41635
>                 URL: https://issues.apache.org/jira/browse/SPARK-41635
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 3.3.0
>            Reporter: Reynold Xin
>            Assignee: Apache Spark
>            Priority: Major
>
> This patch implements GROUP BY ALL, similar to the one initially implemented in DuckDB. When specified, the analyzer automatically infers the grouping columns based on the expressions specified in the select clause: all expressions that don't include any aggregate expressions are pulled implicitly into the grouping columns. This avoids users having to specify individually the list of grouping columns in most cases.
> Examples: 
> {{select key, count(*), sum(score) from table group by all}}
> {{-- rewritten to}}
> {{select key, count(*), sum(score) from table group by key}}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org