You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (Jira)" <ji...@apache.org> on 2019/12/17 16:52:00 UTC

[jira] [Commented] (FLINK-15297) Do not throw exception if YARN Application switched to FINISHED immediately after deployed in YarnClusterDescriptor#startAppMaster

    [ https://issues.apache.org/jira/browse/FLINK-15297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998385#comment-16998385 ] 

Till Rohrmann commented on FLINK-15297:
---------------------------------------

This sounds like a bug to me. In particular in detached mode it could happen that the deployed job terminates and does not wait for the client to connect. Hence, I think we should correct this.

> Do not throw exception if YARN Application switched to FINISHED immediately after deployed in YarnClusterDescriptor#startAppMaster
> ----------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-15297
>                 URL: https://issues.apache.org/jira/browse/FLINK-15297
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / YARN
>            Reporter: Zili Chen
>            Assignee: Zili Chen
>            Priority: Major
>
> Currently we throw an exception in {{YarnClusterDescriptor#startAppMaster}} if we first detect {{FINISHED}} before {{RUNNING}}. However, it is possible a legal state that the application finished normally immediately.
> Right now we always try to connect the Dispatcher so it may be fine to throw the exception a bit earlier(otherwise when connect to a closed cluster an exception thrown also), but it is semantically wrong. Internally we have a code path that only required to report the ApplicationReport and it causes trouble.
> cc [~trohrmann] what do you think?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)