You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2018/08/27 07:48:00 UTC
[jira] [Assigned] (SPARK-24978) Add
spark.sql.fast.hash.aggregate.row.max.capacity to configure the capacity of
fast aggregation.
[ https://issues.apache.org/jira/browse/SPARK-24978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wenchen Fan reassigned SPARK-24978:
-----------------------------------
Assignee: caoxuewen
> Add spark.sql.fast.hash.aggregate.row.max.capacity to configure the capacity of fast aggregation.
> -------------------------------------------------------------------------------------------------
>
> Key: SPARK-24978
> URL: https://issues.apache.org/jira/browse/SPARK-24978
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 2.4.0, 3.0.0
> Reporter: caoxuewen
> Assignee: caoxuewen
> Priority: Major
>
> this pr add a configuration parameter to configure the capacity of fast aggregation.
> Performance comparison:
> /*
> Java HotSpot(TM) 64-Bit Server VM 1.8.0_60-b27 on Windows 7 6.1
> Intel64 Family 6 Model 94 Stepping 3, GenuineIntel
> Aggregate w multiple keys: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
> ------------------------------------------------------------------------------------------------
> fasthash = default 5612 / 5882 3.7 267.6 1.0X
> fasthash = config 3586 / 3595 5.8 171.0 1.6X
> */
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org