You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spark.apache.org by do...@apache.org on 2020/04/18 16:29:17 UTC

[spark] branch branch-3.0 updated: [SPARK-31442][SQL] Print shuffle id at coalesce partitions target size

This is an automated email from the ASF dual-hosted git repository.

dongjoon pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/branch-3.0 by this push:
     new 842fbed  [SPARK-31442][SQL] Print shuffle id at coalesce partitions target size
842fbed is described below

commit 842fbed1d43f2ce5224d1367a7891f636243382e
Author: ulysses <yo...@weidian.com>
AuthorDate: Sat Apr 18 09:27:44 2020 -0700

    [SPARK-31442][SQL] Print shuffle id at coalesce partitions target size
    
    ### What changes were proposed in this pull request?
    
    Minor change. Print shuffle id.
    
    ### Why are the changes needed?
    
    Make log more clear.
    
    ### Does this PR introduce any user-facing change?
    
    No.
    
    ### How was this patch tested?
    
    Not need.
    
    Closes #28211 from ulysses-you/print-shuffle-id.
    
    Authored-by: ulysses <yo...@weidian.com>
    Signed-off-by: Dongjoon Hyun <do...@apache.org>
    (cherry picked from commit 6c2bf8248a439951fe5157b34ad269cb31df0e16)
    Signed-off-by: Dongjoon Hyun <do...@apache.org>
---
 .../apache/spark/sql/execution/adaptive/ShufflePartitionsUtil.scala   | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/ShufflePartitionsUtil.scala b/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/ShufflePartitionsUtil.scala
index 208cc05..e10ed4f 100644
--- a/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/ShufflePartitionsUtil.scala
+++ b/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/ShufflePartitionsUtil.scala
@@ -68,7 +68,9 @@ object ShufflePartitionsUtil extends Logging {
       math.ceil(totalPostShuffleInputSize / minNumPartitions.toDouble).toLong, 16)
     val targetSize = math.min(maxTargetSize, advisoryTargetSize)
 
-    logInfo(s"advisory target size: $advisoryTargetSize, actual target size $targetSize.")
+    val shuffleIds = mapOutputStatistics.map(_.shuffleId).mkString(", ")
+    logInfo(s"For shuffle($shuffleIds), advisory target size: $advisoryTargetSize, " +
+      s"actual target size $targetSize.")
 
     // Make sure these shuffles have the same number of partitions.
     val distinctNumShufflePartitions =


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org