You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Seth Wiesman (Jira)" <ji...@apache.org> on 2021/09/15 20:35:00 UTC

[jira] [Created] (FLINK-24303) SourceCoordinator exception may fail Session Cluster

Seth Wiesman created FLINK-24303:
------------------------------------

             Summary: SourceCoordinator exception may fail Session Cluster
                 Key: FLINK-24303
                 URL: https://issues.apache.org/jira/browse/FLINK-24303
             Project: Flink
          Issue Type: Bug
            Reporter: Seth Wiesman


The SourceCoordinator currently forwards all exceptions from `Source#createEnumerator` up the stack triggering a JobMaster failover. However, JobMaster failover only works if HA is enabled[1]. If not, the fatal error handler will simply exit the JM process killing the entire cluster. 

 

It would be preferable if failure to create an enumerator did not take down the entire cluster, but instead failed that particular job. 

 

[1] https://github.com/apache/flink/blob/7f69331294ab2ab73f77b40a4320cdda53246afe/flink-runtime/src/main/java/org/apache/flink/runtime/dispatcher/Dispatcher.java#L898-L903



--
This message was sent by Atlassian Jira
(v8.3.4#803005)