You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2006/05/02 06:01:47 UTC

[jira] Updated: (HADOOP-185) tasks are lost during pollForNewTask

     [ http://issues.apache.org/jira/browse/HADOOP-185?page=all ]

Owen O'Malley updated HADOOP-185:
---------------------------------

    Attachment: launch-time-out.patch

This patch adds a new thread to the JobTracker that every three minutes checks the list of launched tasks that haven't been reported yet. If any of them are older than 10 minutes, they are marked as failed.

> tasks are lost during pollForNewTask
> ------------------------------------
>
>          Key: HADOOP-185
>          URL: http://issues.apache.org/jira/browse/HADOOP-185
>      Project: Hadoop
>         Type: Bug

>   Components: mapred
>     Versions: 0.1.1
>     Reporter: Owen O'Malley
>     Assignee: Owen O'Malley
>     Priority: Critical
>      Fix For: 0.2
>  Attachments: launch-time-out.patch
>
> There is the potential for "losing" tasks that are assigned by the JobTracker to a TaskTracker, but that fail during returning the result (usually due to a RPC timeout). In this case, the Job becomes "wedged" in that the tasks will never run and never time out.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira