You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "guowp_aily (JIRA)" <ji...@apache.org> on 2013/09/26 08:33:02 UTC
[jira] [Created] (MAPREDUCE-5540) Speculative task makes the default JobQueueTaskScheduler scheduling becomes unreasonable

guowp_aily created MAPREDUCE-5540:
-------------------------------------

             Summary: Speculative task makes the default JobQueueTaskScheduler scheduling becomes unreasonable
                 Key: MAPREDUCE-5540
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5540
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: guowp_aily


Speculative task makes the default JobQueueTaskScheduler scheduling becomes unreasonable

Speculative task resulted in a resource is abundant, using the default scheduler, still prone to (map, reduce) task pend.
The Cluster configuration : 3 tasktracker, 12 reduce slot per node. 

 
In the job queue has only 2 jobs:
job_201309221020_0357's eleven reduce tasks are running, and  job_201309221020_0358 has a reduce in the pending state; 
but my cluster, a total of 36 slot, why does job_201309221020_0358 need to be pending ?
Job_201309221020_0358 has been waiting for 2 minutes, and finally in the job_201309221020_0357 has completed a reduce task after the operation .

Check the operation log and scheduling algorithm source code, found that may be because "Speculative task" lead to scheduling algorithm default becomes less.


The task_201309221020_0357_r_000006 task actual start of two attmept (attempt_201309221020_0357_r_000006_0, attempt_201309221020_0357_r_000006_1), so although the job_201309221020_0357 only eleven reduce tasks, but since the opening Speculative task, causing it to the actual occupation of twelve slot (four slots per node), so the currently running   12 slots. 

According to the default scheduling algorithm, completed the reduce tasks running job_201309221020_0358 reduce task must wait for job_201309221020_0357‘s a reduce task, otherwise it will always be pending.So the default scheduling algorithm is not suitable for open "Speculative task" ？

 

JobQueueTaskScheduler  : 
 
double reduceLoadFactor = (double)remainingReduceLoad / clusterReduceCapacity;
//remainingReduceLoad   job queue：job_201309221020_0357's running Reduce + job_201309221020_0358's pending Reduce = 12 
//clusterReduceCapacity  : 36
//reduceLoadFactor=12/36=0.3333333333333333
 
final int trackerCurrentReduceCapacity = 
    Math.min((int)Math.ceil(reduceLoadFactor * trackerReduceCapacity),   trackerReduceCapacity);
//trackerReduceCapacity  running slot:  job_201309221020_0357 ---   12 slots 
//trackerCurrentReduceCapacity=ceil(0.3333333333333333*12)=4
    
    
final int availableReduceSlots = 
      Math.min((trackerCurrentReduceCapacity - trackerRunningReduces), 1);
//trackerRunningReduces   : 4 slots per node
//availableReduceSlots=Math.min((4 - 4), 1)=0 
 
boolean exceededReducePadding = false;
if(availableReduceSlots > 0) {   // if job_201309221020_0357's reduce tasks is running ,the availableReduceSlots is always less 1
	exceededReducePadding = exceededPadding(false, clusterStatus, trackerReduceCapacity);        
	synchronized (jobQueue) {
		LOG.debug("try to assign 1 reduce task to TaskTracker["+taskTracker.trackerName+"]..");
		for (JobInProgress job : jobQueue) {
			if (job.getStatus().getRunState() != JobStatus.RUNNING || job.numReduceTasks == 0) {
			continue;
	}
... ...



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira