You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Aitozi (Jira)" <ji...@apache.org> on 2022/09/15 05:22:00 UTC
[jira] [Created] (FLINK-29308) NoResourceAvailableException fails the batch job
Aitozi created FLINK-29308:
------------------------------
Summary: NoResourceAvailableException fails the batch job
Key: FLINK-29308
URL: https://issues.apache.org/jira/browse/FLINK-29308
Project: Flink
Issue Type: Improvement
Components: Runtime / Coordination
Reporter: Aitozi
When running batch job configured with the following restart strategy
{code:java}
restart-strategy: fixed-delay
restart-strategy.fixed-delay.delay: 15 s
restart-strategy.fixed-delay.attempts: 10 {code}
If the cluster resource is not enough to run the single stage, it can run partial of the stage, but it still will fail after the 10 times \{{NoResourceAvailableException}}. IMO, for batch job the \{{NoResourceAvailableException}} do not necessary to trigger the job to fail. Or at least this failure reason are not share the same restart strategy with other failure reasons
--
This message was sent by Atlassian Jira
(v8.20.10#820010)