You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@flink.apache.org by "Aitozi (Jira)" <ji...@apache.org> on 2022/09/15 05:22:00 UTC

[jira] [Created] (FLINK-29308) NoResourceAvailableException fails the batch job

Aitozi created FLINK-29308:
------------------------------

             Summary: NoResourceAvailableException fails the batch job
                 Key: FLINK-29308
                 URL: https://issues.apache.org/jira/browse/FLINK-29308
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / Coordination
            Reporter: Aitozi


When running batch job configured with the following restart strategy
{code:java}
restart-strategy: fixed-delay
restart-strategy.fixed-delay.delay: 15 s
restart-strategy.fixed-delay.attempts: 10 {code}
If the cluster resource is not enough to run the single stage, it can run partial of the stage, but it still will fail after the 10 times \{{NoResourceAvailableException}}. IMO, for batch job the \{{NoResourceAvailableException}} do not necessary to trigger the job to fail. Or at least this failure reason are not share the same restart strategy with other failure reasons



--
This message was sent by Atlassian Jira
(v8.20.10#820010)