You are viewing a plain text version of this content. The canonical link for it is here.
Posted to yarn-issues@hadoop.apache.org by "Ahmed Hussein (Jira)" <ji...@apache.org> on 2020/12/17 03:36:00 UTC

[jira] [Commented] (YARN-10536) Client in distributedShell swallows interrupt exceptions

    [ https://issues.apache.org/jira/browse/YARN-10536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17250778#comment-17250778 ] 

Ahmed Hussein commented on YARN-10536:
--------------------------------------

The current implementation checks the timeout with reference to {{Client.clientStartTime}}. The latter is the timestamp of the object creation as shown in that [line of code|https://github.com/apache/hadoop/blob/df7f1e5199eed917ff40618708e7641238684d24/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/Client.java#L212].
The timeout should be measured when the client gets started (by calling {{run()}}) like in that [line of code|https://github.com/apache/hadoop/blob/df7f1e5199eed917ff40618708e7641238684d24/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/Client.java#L671]. I do not think there is a point starting countdown on object creation?

> Client in distributedShell swallows interrupt exceptions
> --------------------------------------------------------
>
>                 Key: YARN-10536
>                 URL: https://issues.apache.org/jira/browse/YARN-10536
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: client, distributed-shell
>            Reporter: Ahmed Hussein
>            Assignee: Ahmed Hussein
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> In {{applications.distributedshell.Client}} , the method {{monitorApplication}} loops waiting for the following conditions:
> * Application fails: reaches {{YarnApplicationState.KILLED}}, or {{YarnApplicationState.FAILED}}
> * Application succeeds: {{FinalApplicationStatus.SUCCEEDED}} or {{YarnApplicationState.FINISHED}}
> * the time spent waiting is longer than {{clientTimeout}} (if it exists in the parameters).
> When the Client thread is interrupted, it ignores the exception:
> {code:java}
>       // Check app status every 1 second.
>       try {
>         Thread.sleep(1000);
>       } catch (InterruptedException e) {
>         LOG.debug("Thread sleep in monitoring loop interrupted");
>       }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: yarn-issues-help@hadoop.apache.org