You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Pablo Langa Blanco (Jira)" <ji...@apache.org> on 2021/04/05 23:18:00 UTC

[jira] [Created] (SPARK-34961) Migrate First function from DeclarativeAggregate to TypedImperativeAggregate to improve performance

Pablo Langa Blanco created SPARK-34961:
------------------------------------------

             Summary: Migrate First function from DeclarativeAggregate to TypedImperativeAggregate to improve performance
                 Key: SPARK-34961
                 URL: https://issues.apache.org/jira/browse/SPARK-34961
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.2.0
            Reporter: Pablo Langa Blanco


The main objective of this change is to improve performance in some cases.

We have three possibilities when we plan an aggregation. In the first case, with mutable primitive types, HashAggregate is used.

When we are not using these types we have two options. If the function implements TypedImperativeAggregate we use ObjectHashAggregate. Otherwise, we use SortAggregate that is less efficient.

In this PR I propose to migrate First function to implement TypedImperativeAggregate to take advantage of this feature (ObjectAggregateExec)

This Jira is related to SPARK-34464

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org