You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Kuhu Shukla (JIRA)" <ji...@apache.org> on 2016/01/22 15:57:39 UTC

[jira] [Commented] (MAPREDUCE-6603) Add counters for failed task attempts

    [ https://issues.apache.org/jira/browse/MAPREDUCE-6603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15112497#comment-15112497 ] 

Kuhu Shukla commented on MAPREDUCE-6603:
----------------------------------------

At Task level, {{selectBestAttempt()}} decides which is the best attempt, and returns null if all the attempts have failed. The null return is checked during counters, progress and report update to eliminate a failed task/attempts. To address the issue in this JIRA, in case there are no successful attempts we can set a flag and return a counters object with all values set to 0. What this would mean is, if we go to task counters we will see all 0 counters instead of 'Sorry it looks like task_1_2_r_3 has no counters. ' and when you click on individual counters it takes you to the attempt counters which hold actual values. The pros are we open up a way see failed attempts without causing task level counters to be incorrect. What was null before is now an all zero counter. The same effect would then cascade to job level counters being all zeros for a fully failed job. The cons are we depart from the design that failed( and killed) tasks should not be considered and it may interfere with other services reading it, which needs validation after this change, for example JHS, curl queries etc. I would appreciate any comments, suggestions and insights for this change. Thanks a lot!

> Add counters for failed task attempts
> -------------------------------------
>
>                 Key: MAPREDUCE-6603
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6603
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 2.7.1, 2.6.3
>            Reporter: Kuhu Shukla
>            Assignee: Kuhu Shukla
>            Priority: Minor
>
> The counters for failed task attempts are currently unavailable and would be nice to have for troubleshooting whilst not including them in the aggregate counters at task or job level. One should be able to view them at attempt level.
> {code}
> Sorry it looks like task_1_2_r_3 has no counters. 
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)