You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Robert Metzger (Jira)" <ji...@apache.org> on 2021/01/18 13:00:00 UTC
[jira] [Closed] (FLINK-19067) resource_manager and dispatcher
register on different nodes in HA mode will cause FileNotFoundException
[ https://issues.apache.org/jira/browse/FLINK-19067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Metzger closed FLINK-19067.
----------------------------------
Resolution: Not A Problem
> resource_manager and dispatcher register on different nodes in HA mode will cause FileNotFoundException
> -------------------------------------------------------------------------------------------------------
>
> Key: FLINK-19067
> URL: https://issues.apache.org/jira/browse/FLINK-19067
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination
> Affects Versions: 1.11.1
> Reporter: JieFang.He
> Assignee: Robert Metzger
> Priority: Major
> Attachments: flink-jobmanager-deployer-hejiefang01.log, flink-jobmanager-deployer-hejiefang02.log, flink-taskmanager-deployer-hejiefang01.log, flink-taskmanager-deployer-hejiefang02.log
>
>
> When run examples/batch/WordCount.jar,it will fail with the exception:
> {code:java}
> Caused by: java.io.FileNotFoundException: /data2/flink/storageDir/default/blob/job_d29414828f614d5466e239be4d3889ac/blob_p-a2ebe1c5aa160595f214b4bd0f39d80e42ee2e93-f458f1c12dc023e78d25f191de1d7c4b (No such file or directory)
> at java.io.FileInputStream.open0(Native Method)
> at java.io.FileInputStream.open(FileInputStream.java:195)
> at java.io.FileInputStream.<init>(FileInputStream.java:138)
> at org.apache.flink.core.fs.local.LocalDataInputStream.<init>(LocalDataInputStream.java:50)
> at org.apache.flink.core.fs.local.LocalFileSystem.open(LocalFileSystem.java:143)
> at org.apache.flink.runtime.blob.FileSystemBlobStore.get(FileSystemBlobStore.java:105)
> at org.apache.flink.runtime.blob.FileSystemBlobStore.get(FileSystemBlobStore.java:87)
> at org.apache.flink.runtime.blob.BlobServer.getFileInternal(BlobServer.java:501)
> at org.apache.flink.runtime.blob.BlobServerConnection.get(BlobServerConnection.java:231)
> at org.apache.flink.runtime.blob.BlobServerConnection.run(BlobServerConnection.java:117)
> {code}
>
> I think the reason is that the jobFiles are upload to the dispatcher node,but the task get jobFiles from resource_manager node. So in HA mode, it need to ensure they are on one node
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)