You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Robert Joseph Evans (JIRA)" <ji...@apache.org> on 2013/03/12 20:45:13 UTC

[jira] [Created] (MAPREDUCE-5060) Fetch failures that time out only count against the first map task

Robert Joseph Evans created MAPREDUCE-5060:
----------------------------------------------

             Summary: Fetch failures that time out only count against the first map task
                 Key: MAPREDUCE-5060
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5060
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: Robert Joseph Evans
            Priority: Critical


When a fetch failure happens, if the socket has already "connected" it is only counted against the first map task.  But most of the time it is because of an issue with the Node itself, not the individual map task, and as such all failures when trying to initiate the connection should count against all of the tasks.

This caused a particularly unfortunate job to take an hour an a half longer then it needed to.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira