You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Yingda Chen (JIRA)" <ji...@apache.org> on 2019/04/26 17:33:00 UTC

[jira] [Commented] (TEZ-4062) Speculative attempt scheduling should be aborted when Task has completed

    [ https://issues.apache.org/jira/browse/TEZ-4062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16827145#comment-16827145 ] 

Yingda Chen commented on TEZ-4062:
----------------------------------

[~jeagles] any progress on the build failure, and anything we can assist?

Thanks,

> Speculative attempt scheduling should be aborted when Task has completed
> ------------------------------------------------------------------------
>
>                 Key: TEZ-4062
>                 URL: https://issues.apache.org/jira/browse/TEZ-4062
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Yingda Chen
>            Assignee: Ying Han
>            Priority: Major
>         Attachments: TEZ-4062.001.patch
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> In RedundantScheduleTransition (inside TaskImpl), we try to find the oldest running attempt and use it as the causual attempt when doing "addAndScheduleAttempt".
>  
> However, the task may have completed at this moment, i.e., the task attempt that was considered running and long-tailed by speculator is now completed. In this case, we may not be able to find any unfinished attempt, which will lead to NPE in following logic (even without NPE, it still makes no sense to proceed with scheduling speculative attempt anyway)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)