You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@uniffle.apache.org by GitBox <gi...@apache.org> on 2022/07/14 14:46:13 UTC

[GitHub] [incubator-uniffle] jerqi commented on pull request #55: [Feature][MR] Support remote spill

jerqi commented on PR #55:
URL: https://github.com/apache/incubator-uniffle/pull/55#issuecomment-1184538911

   > ### What changes were proposed in this pull request?
   > Rewrite Mapreduce's MergerManager to spill sorted segments to HDFS, It returns a merge-sorted iterator to read these HDFS segments.
   > 
   > ### Why are the changes needed?
   > In cloud, machines may have very limited disk space and performance. This PR allows to spill data to remote storage (e.g., hdfs)
   > 
   > ### Does this PR introduce _any_ user-facing change?
   > Yes. rss.reduce.remote.spill.enable (default false)
   > 
   > ### How was this patch tested?
   > New UT and IT with remote spill.
   > 
   > Co-authored-by: roryqi [roryqi@tencent.com](mailto:roryqi@tencent.com)
   
   Because this pr will introduce user-facing change. We should update doc.
   And we should supply the performance test results.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@uniffle.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@uniffle.apache.org
For additional commands, e-mail: issues-help@uniffle.apache.org