You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Reynold Xin (Jira)" <ji...@apache.org> on 2022/12/20 23:02:00 UTC

[jira] [Created] (SPARK-41635) GROUP BY ALL

Reynold Xin created SPARK-41635:
-----------------------------------

             Summary: GROUP BY ALL
                 Key: SPARK-41635
                 URL: https://issues.apache.org/jira/browse/SPARK-41635
             Project: Spark
          Issue Type: New Feature
          Components: SQL
    Affects Versions: 3.3.0
            Reporter: Reynold Xin
            Assignee: Reynold Xin


This patch implements GROUP BY ALL, similar to the one initially implemented in DuckDB. When specified, the analyzer automatically infers the grouping columns based on the expressions specified in the select clause: all expressions that don't include any aggregate expressions are pulled implicitly into the grouping columns. This avoids users having to specify individually the list of grouping columns in most cases.

Examples: 

{{select key, count(*), sum(score) from table group by all}}
{{-- rewritten to}}
{{select key, count(*), sum(score) from table group by key}}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org