You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Arun C Murthy (JIRA)" <ji...@apache.org> on 2009/05/19 09:18:45 UTC
[jira] Created: (HADOOP-5870) Implement a memory-to-memory sort in
the map task
Implement a memory-to-memory sort in the map task
-------------------------------------------------
Key: HADOOP-5870
URL: https://issues.apache.org/jira/browse/HADOOP-5870
Project: Hadoop Core
Issue Type: Improvement
Components: mapred
Reporter: Arun C Murthy
The motivation is similar to HADOOP-5831...
Currently we collect map-outputs in the sort buffer (io.sort.mb) which we eventually sort and spill to disk. For latency-sensitive applications with sufficient memory, e.g. terasort, we could do better by doing a memory-to-memory sort followed by a final memory-to-disk merge.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.