You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Julio Biason (JIRA)" <ji...@apache.org> on 2018/07/25 17:57:00 UTC

[jira] [Created] (FLINK-9960) JobManager can't failover to hdfs with multiple namenodes

Julio Biason created FLINK-9960:
-----------------------------------

             Summary: JobManager can't failover to hdfs with multiple namenodes
                 Key: FLINK-9960
                 URL: https://issues.apache.org/jira/browse/FLINK-9960
             Project: Flink
          Issue Type: Bug
          Components: JobManager
    Affects Versions: 1.4.0
            Reporter: Julio Biason


When Hadoop/HDFS is running in HA mode, there may be more than one namenode running: One active and another standby.

If, for whatever reasons, the server configured in high-availability.storageDir is currently in standby mode (e.g., Hadoop failed over the other namenode), the JobManager will refuse to start because it can't contact hdfs – in this case, it should have a list of servers (up to 2) and, in case of a failure to connect, go to the next one.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)