You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Kuhu Shukla (JIRA)" <ji...@apache.org> on 2017/07/24 22:28:00 UTC

[jira] [Updated] (TEZ-3803) Tasks can get killed due to insufficient progress while waiting for shuffle inputs to complete

     [ https://issues.apache.org/jira/browse/TEZ-3803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kuhu Shukla updated TEZ-3803:
-----------------------------
    Attachment: TEZ-3803.001.patch

First cut of the patch which does a timed wait and notifies progress.

> Tasks can get killed due to insufficient progress while waiting for shuffle inputs to complete
> ----------------------------------------------------------------------------------------------
>
>                 Key: TEZ-3803
>                 URL: https://issues.apache.org/jira/browse/TEZ-3803
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Kuhu Shukla
>            Assignee: Kuhu Shukla
>            Priority: Critical
>         Attachments: TEZ-3803.001.patch
>
>
> In a scenario where a downstream task has no slow start and gets started before all its shuffle inputs are done, the task can timeout as the wait does not notify progress( set the "progress is being made bit") like it does in MapReduce.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)