You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "CanBin Zheng (JIRA)" <ji...@apache.org> on 2017/06/01 02:06:04 UTC
[jira] [Created] (SPARK-20943) Correct
BypassMergeSortShuffleWriter's comment
CanBin Zheng created SPARK-20943:
------------------------------------
Summary: Correct BypassMergeSortShuffleWriter's comment
Key: SPARK-20943
URL: https://issues.apache.org/jira/browse/SPARK-20943
Project: Spark
Issue Type: Improvement
Components: Shuffle, Spark Core
Affects Versions: 2.1.1
Reporter: CanBin Zheng
There are some comments written in BypassMergeSortShuffleWriter.java about when to select this write path, the three required conditions are described as follows:
1. no Ordering is specified, and
2. no Aggregator is specified, and
3. the number of partitions is less than
spark.shuffle.sort.bypassMergeThreshold
Obviously, the conditions written are partially wrong and misleading, the right conditions should be:
1. map-side combine is false, and
2. the number of partitions is less than
spark.shuffle.sort.bypassMergeThreshold
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org