You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user-zh@flink.apache.org by sky <su...@qq.com.INVALID> on 2021/11/12 12:59:59 UTC

请求帮助

我使用的事flink on yarn。在执行命令时: &nbsp; flink run -m yarn-cluster ./examples/batch/WordCount.jar&nbsp;&nbsp;  结果却报错了:
------------------------------------------------------------
&nbsp;The program finished with the following exception:

org.apache.flink.client.program.ProgramInvocationException: The main method caused an error: org.apache.flink.runtime.rest.util.RestClientException: [org.apache.flink.runtime.rest.handler.RestHandlerException: org.apache.flink.runtime.messages.FlinkJobNotFoundException: Could not find Flink job (397a081a0313f462818575fc725b3582)
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at org.apache.flink.runtime.rest.handler.job.JobExecutionResultHandler.propagateException(JobExecutionResultHandler.java:94)
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at org.apache.flink.runtime.rest.handler.job.JobExecutionResultHandler.lambda$handleRequest$1(JobExecutionResultHandler.java:84)
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:870)

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ...
麻烦告知是什么原因呢,我配置文件是这样的:
#===============================================================================
high-availability: zookeeper
high-availability.storageDir: hdfs://mycluster/flink/ha/
high-availability.zookeeper.quorum: hadoop201:2181,hadoop202:2181,hadoop203:2181
high-availability.zookeeper.path.root: /flink
high-availability.cluster-id: /default_one # important: customize per cluster
#设置ck的状态后端
state.backend: filesystem
state.checkpoints.dir: hdfs://mycluster/flink/checkpoints
#设置默认的savepoint的保存位置
state.savepoints.dir: hdfs://mycluster/flink/savepoints
# 集群名称不能写错
jobmanager.archive.fs.dir: hdfs://mycluster/flink/completed-jobs/
historyserver.archive.fs.dir: hdfs://mycluster/flink/completed-jobs/
#===============================================================================

谢谢!

Re: 请求帮助

Posted by Yang Wang <da...@gmail.com>.
你可以查看一下JM的日志,看看提交的job或者failover之后恢复的job
id是不是397a081a0313f462818575fc725b3582


Best,
Yang

RS <ti...@163.com> 于2021年11月15日周一 上午9:53写道:

> 查看下client的日志,一般在flink的logs目录下
>
>
>
>
> 在 2021-11-12 20:59:59,"sky" <su...@qq.com.INVALID> 写道:
> >我使用的事flink on yarn。在执行命令时: &nbsp; flink run -m yarn-cluster
> ./examples/batch/WordCount.jar&nbsp;&nbsp;  结果却报错了:
> >------------------------------------------------------------
> >&nbsp;The program finished with the following exception:
> >
> >org.apache.flink.client.program.ProgramInvocationException: The main
> method caused an error:
> org.apache.flink.runtime.rest.util.RestClientException:
> [org.apache.flink.runtime.rest.handler.RestHandlerException:
> org.apache.flink.runtime.messages.FlinkJobNotFoundException: Could not find
> Flink job (397a081a0313f462818575fc725b3582)
> >&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at
> org.apache.flink.runtime.rest.handler.job.JobExecutionResultHandler.propagateException(JobExecutionResultHandler.java:94)
> >&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at
> org.apache.flink.runtime.rest.handler.job.JobExecutionResultHandler.lambda$handleRequest$1(JobExecutionResultHandler.java:84)
> >&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at
> java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:870)
> >
> >&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ...
> >麻烦告知是什么原因呢,我配置文件是这样的:
>
> >#===============================================================================
> >high-availability: zookeeper
> >high-availability.storageDir: hdfs://mycluster/flink/ha/
> >high-availability.zookeeper.quorum:
> hadoop201:2181,hadoop202:2181,hadoop203:2181
> >high-availability.zookeeper.path.root: /flink
> >high-availability.cluster-id: /default_one # important: customize per
> cluster
> >#设置ck的状态后端
> >state.backend: filesystem
> >state.checkpoints.dir: hdfs://mycluster/flink/checkpoints
> >#设置默认的savepoint的保存位置
> >state.savepoints.dir: hdfs://mycluster/flink/savepoints
> ># 集群名称不能写错
> >jobmanager.archive.fs.dir: hdfs://mycluster/flink/completed-jobs/
> >historyserver.archive.fs.dir: hdfs://mycluster/flink/completed-jobs/
>
> >#===============================================================================
> >
> >谢谢!
>

Re:请求帮助

Posted by RS <ti...@163.com>.
查看下client的日志,一般在flink的logs目录下




在 2021-11-12 20:59:59,"sky" <su...@qq.com.INVALID> 写道:
>我使用的事flink on yarn。在执行命令时: &nbsp; flink run -m yarn-cluster ./examples/batch/WordCount.jar&nbsp;&nbsp;  结果却报错了:
>------------------------------------------------------------
>&nbsp;The program finished with the following exception:
>
>org.apache.flink.client.program.ProgramInvocationException: The main method caused an error: org.apache.flink.runtime.rest.util.RestClientException: [org.apache.flink.runtime.rest.handler.RestHandlerException: org.apache.flink.runtime.messages.FlinkJobNotFoundException: Could not find Flink job (397a081a0313f462818575fc725b3582)
>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at org.apache.flink.runtime.rest.handler.job.JobExecutionResultHandler.propagateException(JobExecutionResultHandler.java:94)
>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at org.apache.flink.runtime.rest.handler.job.JobExecutionResultHandler.lambda$handleRequest$1(JobExecutionResultHandler.java:84)
>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; at java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:870)
>
>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ...
>麻烦告知是什么原因呢,我配置文件是这样的:
>#===============================================================================
>high-availability: zookeeper
>high-availability.storageDir: hdfs://mycluster/flink/ha/
>high-availability.zookeeper.quorum: hadoop201:2181,hadoop202:2181,hadoop203:2181
>high-availability.zookeeper.path.root: /flink
>high-availability.cluster-id: /default_one # important: customize per cluster
>#设置ck的状态后端
>state.backend: filesystem
>state.checkpoints.dir: hdfs://mycluster/flink/checkpoints
>#设置默认的savepoint的保存位置
>state.savepoints.dir: hdfs://mycluster/flink/savepoints
># 集群名称不能写错
>jobmanager.archive.fs.dir: hdfs://mycluster/flink/completed-jobs/
>historyserver.archive.fs.dir: hdfs://mycluster/flink/completed-jobs/
>#===============================================================================
>
>谢谢!