You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/10/07 12:52:02 UTC

[GitHub] [spark] cloud-fan commented on a change in pull request #26040: [SPARK-9853][Core] Optimize shuffle fetch of continuous partition IDs

cloud-fan commented on a change in pull request #26040: [SPARK-9853][Core] Optimize shuffle fetch of continuous partition IDs
URL: https://github.com/apache/spark/pull/26040#discussion_r331999858
 
 

 ##########
 File path: common/network-shuffle/src/main/java/org/apache/spark/network/shuffle/protocol/FetchShuffleBlocks.java
 ##########
 @@ -36,19 +36,24 @@
   // it corresponds to the i-th int[] in reduceIds, which contains all reduce id for this map id.
   public final long[] mapIds;
   public final int[][] reduceIds;
+  // Indicates whether read continuous shuffle blocks in batch for IO reducing. When it is true,
+  // the reduceIds contains the range [startReduceId, endReduceId) for each ShuffleBlockBatchId.
 
 Review comment:
   I think it's clearer to document `reduceIds` instead of `batchFetchEnabled`. For example,
   ```
   when batchFetchEnabled=true, reduceIds[i] contains 2 elements: startReduceId (inclusive) and
   endReduceId (exclusive) for the mapper mapIds[i].
   when batchFetchEnabled=false, reduceIds[i] contains all the reduce IDs that mapper mapIds[i] needs to fetch.
   ```

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org