You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user-zh@flink.apache.org by 仙剑……情动人间 <15...@qq.com.INVALID> on 2021/11/13 04:27:14 UTC
回复: flink 触发保存点失败
非常感谢, 确实是这个问题,我把基于 zk 的ha配置 使用 —D 参数指定之后就成功触发了savepoint
------------------ 原始邮件 ------------------
发件人: "user-zh" <lycbug666@gmail.com>;
发送时间: 2021年7月28日(星期三) 中午11:16
收件人: "user-zh"<user-zh@flink.apache.org>;
主题: Re: flink 触发保存点失败
Hi,
之前遇到过这个 jobid 为 00000 的报错情况。我们的场景是是任务开启了基于 zk 的 ha,但是使用未配置 ha 的 flink
client 去运行 savepoint 命令。
可以考虑下是否是相同的问题。
Michael Ran <greemqqran@163.com> 于2021年7月23日周五 上午10:43写道:
> 有没可能是文件的问题,比如写入权限之类的?
> 在 2021-07-13 17:31:19,"仙剑……情动人间" <1510603449@qq.com.INVALID> 写道:
> >Hi All,
> >
> >
> >&nbsp; &nbsp; 我触发Flink
> 保存点总是失败,报错如下,一直说是超时,但是没有进一步的信息可以查看,我查资料说可以设置checkpoint超时时间,我设置了2min,但是触发
> >保存点时在2min之前就会报错,另外我的 状态 并不大
> >&nbsp; &nbsp;
> >
> >
> >------------------------------------------------------------
> >&nbsp;The program finished with the following exception:
> >
> >
> >org.apache.flink.util.FlinkException: Triggering a savepoint for the job
> 00000000000000000000000000000000 failed.
> > at
> org.apache.flink.client.cli.CliFrontend.triggerSavepoint(CliFrontend.java:777)
> > at
> org.apache.flink.client.cli.CliFrontend.lambda$savepoint$9(CliFrontend.java:754)
> > at
> org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
> > at
> org.apache.flink.client.cli.CliFrontend.savepoint(CliFrontend.java:751)
> > at
> org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1072)
> > at
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
> > at java.security.AccessController.doPrivileged(Native Method)
> > at javax.security.auth.Subject.doAs(Subject.java:422)
> > at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
> > at
> org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
> > at
> org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
> >Caused by: java.util.concurrent.TimeoutException
> > at
> org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1255)
> > at
> org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:217)
> > at
> org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:582)
> > at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> > at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> > at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> > at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> > at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> > at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> > at java.lang.Thread.run(Thread.java:748)
>