You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user-zh@flink.apache.org by 陈韬 <to...@qq.com> on 2019/03/22 08:45:03 UTC
提交job到服务器,报错Failed to retrieve the JobManager gateway.
本机上用官方的flink 1.6的docker,部署的standalone,提交任务一直报错,
Exception in thread "main" org.apache.flink.client.program.ProgramInvocationException: Failed to retrieve the JobManager gateway. (JobID: 18a5c5c849ca12278312218a7391eb46)
at org.apache.flink.client.program.ClusterClient.runDetached(ClusterClient.java:542)
at org.apache.flink.client.program.StandaloneClusterClient.submitJob(StandaloneClusterClient.java:117)
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:486)
at org.apache.flink.client.program.DetachedEnvironment.finalizeExecute(DetachedEnvironment.java:77)
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:432)
at com.dtstack.flink.sql.launcher.LauncherMain.main(LauncherMain.java:87)
Caused by: org.apache.flink.util.FlinkException: Could not connect to the leading JobManager. Please check that the JobManager is running.
at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:956)
at org.apache.flink.client.program.ClusterClient.runDetached(ClusterClient.java:539)
... 5 more
Caused by: org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not retrieve the leader gateway.
at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:83)
at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:951)
... 6 more
Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000 milliseconds]
at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:223)
at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:227)
at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:190)
at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)
at scala.concurrent.Await$.result(package.scala:190)
at scala.concurrent.Await.result(package.scala)
at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:81)
... 7 more
Re: 提交job到服务器,报错Failed to retrieve the JobManager gateway.
Posted by 陈韬 <to...@qq.com>.
首先要谢谢您。
如果您说的配置文件指的是flink-conf.yaml的话,我确定没有放开任何关于checkpoint的配置
> 在 2019年3月22日,下午4:56,xiaoh20@chinaunicom.cn 写道:
>
> 检查您的配置文件,可能是使用了checkpoint,但checkpoint路径设置有问题,从而找不到 JobID
> 我也是初学者,希望能帮到您
>
> Good Luck!
> 发件人: 陈韬
> 发送时间: 2019-03-22 16:45
> 收件人: user-zh@flink.apache.org
> 主题: 提交job到服务器,报错Failed to retrieve the JobManager gateway.
> 本机上用官方的flink 1.6的docker,部署的standalone,提交任务一直报错,
> Exception in thread "main" org.apache.flink.client.program.ProgramInvocationException: Failed to retrieve the JobManager gateway. (JobID: 18a5c5c849ca12278312218a7391eb46)
> at org.apache.flink.client.program.ClusterClient.runDetached(ClusterClient.java:542)
> at org.apache.flink.client.program.StandaloneClusterClient.submitJob(StandaloneClusterClient.java:117)
> at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:486)
> at org.apache.flink.client.program.DetachedEnvironment.finalizeExecute(DetachedEnvironment.java:77)
> at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:432)
> at com.dtstack.flink.sql.launcher.LauncherMain.main(LauncherMain.java:87)
> Caused by: org.apache.flink.util.FlinkException: Could not connect to the leading JobManager. Please check that the JobManager is running.
> at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:956)
> at org.apache.flink.client.program.ClusterClient.runDetached(ClusterClient.java:539)
> ... 5 more
> Caused by: org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not retrieve the leader gateway.
> at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:83)
> at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:951)
> ... 6 more
> Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000 milliseconds]
> at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:223)
> at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:227)
> at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:190)
> at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)
> at scala.concurrent.Await$.result(package.scala:190)
> at scala.concurrent.Await.result(package.scala)
> at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:81)
> ... 7 more
>
>
回复: 提交job到服务器,报错Failed to retrieve the JobManager gateway.
Posted by "xiaoh20@chinaunicom.cn" <xi...@chinaunicom.cn>.
检查您的配置文件,可能是使用了checkpoint,但checkpoint路径设置有问题,从而找不到 JobID
我也是初学者,希望能帮到您
Good Luck!
发件人: 陈韬
发送时间: 2019-03-22 16:45
收件人: user-zh@flink.apache.org
主题: 提交job到服务器,报错Failed to retrieve the JobManager gateway.
本机上用官方的flink 1.6的docker,部署的standalone,提交任务一直报错,
Exception in thread "main" org.apache.flink.client.program.ProgramInvocationException: Failed to retrieve the JobManager gateway. (JobID: 18a5c5c849ca12278312218a7391eb46)
at org.apache.flink.client.program.ClusterClient.runDetached(ClusterClient.java:542)
at org.apache.flink.client.program.StandaloneClusterClient.submitJob(StandaloneClusterClient.java:117)
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:486)
at org.apache.flink.client.program.DetachedEnvironment.finalizeExecute(DetachedEnvironment.java:77)
at org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:432)
at com.dtstack.flink.sql.launcher.LauncherMain.main(LauncherMain.java:87)
Caused by: org.apache.flink.util.FlinkException: Could not connect to the leading JobManager. Please check that the JobManager is running.
at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:956)
at org.apache.flink.client.program.ClusterClient.runDetached(ClusterClient.java:539)
... 5 more
Caused by: org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not retrieve the leader gateway.
at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:83)
at org.apache.flink.client.program.ClusterClient.getJobManagerGateway(ClusterClient.java:951)
... 6 more
Caused by: java.util.concurrent.TimeoutException: Futures timed out after [10000 milliseconds]
at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:223)
at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:227)
at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:190)
at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)
at scala.concurrent.Await$.result(package.scala:190)
at scala.concurrent.Await.result(package.scala)
at org.apache.flink.runtime.util.LeaderRetrievalUtils.retrieveLeaderGateway(LeaderRetrievalUtils.java:81)
... 7 more