You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Ufuk Celebi (JIRA)" <ji...@apache.org> on 2015/11/04 16:48:27 UTC

[jira] [Created] (FLINK-2970) Yarn client cannot connect to new job manager

Ufuk Celebi created FLINK-2970:
----------------------------------

             Summary: Yarn client cannot connect to new job manager
                 Key: FLINK-2970
                 URL: https://issues.apache.org/jira/browse/FLINK-2970
             Project: Flink
          Issue Type: Bug
          Components: Distributed Runtime, YARN Client
    Affects Versions: 0.10
            Reporter: Ufuk Celebi


I'm running a YARN session with 2 physical nodes and 5 containers (ApplicationMaster and 4 TaskManagers). There is no Flink program submitted to the cluster.

Running a sequence of failure operations (killing the ApplicationMaster and TaskManager containers), I sometimes get an infinite loop of

{code}
15:45:29,719 WARN  Remoting                                                      - Tried to associate with unreachable remote address [akka.tcp://flink@10.240.0.3:58926]. Address is now gated for 5000 ms, all messages to this address will be delivered to dead letters. Reason: Connection refused: /10.240.0.3:58926
{code}

I see that the ApplicationMaster container has been started though.

I would not block the RC on this and address it for 0.10.1 or 1.0.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)