You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Amar Kamat (JIRA)" <ji...@apache.org> on 2009/07/28 14:13:14 UTC

[jira] Created: (MAPREDUCE-812) Total number of splits/maps can be encoded as the first field while serializing splits

Total number of splits/maps can be encoded as the first field while serializing splits
--------------------------------------------------------------------------------------

                 Key: MAPREDUCE-812
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-812
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
          Components: jobtracker
            Reporter: Amar Kamat


To find out the total number of maps, the whole split file is deserialized and  then the checks are made (num-maps = length of the split array). The issue is that if total number of splits is more then unnecessarily load all the splits and then discard it. Instead we can encode the total number of splits as the first field.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.