You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Yunhong Zheng (Jira)" <ji...@apache.org> on 2023/02/09 03:56:00 UTC
[jira] [Created] (FLINK-30971) Modify the default value of parameter 'table.exec.local-hash-agg.adaptive.sampling-threshold'
Yunhong Zheng created FLINK-30971:
-------------------------------------
Summary: Modify the default value of parameter 'table.exec.local-hash-agg.adaptive.sampling-threshold'
Key: FLINK-30971
URL: https://issues.apache.org/jira/browse/FLINK-30971
Project: Flink
Issue Type: Bug
Components: Table SQL / Runtime
Affects Versions: 1.17.0
Reporter: Yunhong Zheng
Fix For: 1.17.0
In our test environment, we set the default parallelism to 1 and got the most appropriate default value of parameter 'table.exec.local-hash-agg.adaptive.sampling-threshold' is 5000000. However, for these batch jobs with high parallelism in produce environment, the amount of data in single parallelism is almost less than 5000000. Therefore, after testing, we found that set to 500000 can get better results.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)