You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Amar Kamat (JIRA)" <ji...@apache.org> on 2009/07/28 14:13:14 UTC
[jira] Created: (MAPREDUCE-812) Total number of splits/maps can be
encoded as the first field while serializing splits
Total number of splits/maps can be encoded as the first field while serializing splits
--------------------------------------------------------------------------------------
Key: MAPREDUCE-812
URL: https://issues.apache.org/jira/browse/MAPREDUCE-812
Project: Hadoop Map/Reduce
Issue Type: Improvement
Components: jobtracker
Reporter: Amar Kamat
To find out the total number of maps, the whole split file is deserialized and then the checks are made (num-maps = length of the split array). The issue is that if total number of splits is more then unnecessarily load all the splits and then discard it. Instead we can encode the total number of splits as the first field.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.