You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Siddharth Seth (JIRA)" <ji...@apache.org> on 2014/10/18 01:49:33 UTC

[jira] [Updated] (TEZ-1682) Tez AM hangs at times when there are task failures

     [ https://issues.apache.org/jira/browse/TEZ-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Siddharth Seth updated TEZ-1682:
--------------------------------
    Attachment: TEZ-1682.1.txt

Fairly straight forward patch. 
task.taskAttemptStatus.clear() on a KillRequest seems incorrect - since it's used to keep track of completed events.

Added a test to verify the Task state change.

[~hitesh], [~zjffdu] - please review - keeping in mind that multiple Finished events should not be generated.

> Tez AM hangs at times when there are task failures
> --------------------------------------------------
>
>                 Key: TEZ-1682
>                 URL: https://issues.apache.org/jira/browse/TEZ-1682
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>            Priority: Blocker
>         Attachments: TEZ-1682.1.txt
>
>
> Reported by [~karams]. 
> The Task does not move into it's final state, and effectively does not send the relevant events to the Vertex.
> Happens when there's multiple attempts for the task - caused by Node failure for instance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)