You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Liu (Jira)" <ji...@apache.org> on 2022/01/07 11:17:00 UTC

[jira] [Created] (FLINK-25566) Fail to cancel task if disk is bad for java.lang.NoClassDefFoundError

Liu created FLINK-25566:
---------------------------

             Summary: Fail to cancel task if disk is bad for java.lang.NoClassDefFoundError
                 Key: FLINK-25566
                 URL: https://issues.apache.org/jira/browse/FLINK-25566
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / Task
            Reporter: Liu
         Attachments: image-2022-01-07-19-07-10-968.png, image-2022-01-07-19-08-49-038.png, image-2022-01-07-19-11-39-448.png

When we detecting disk error, we will restart the job to rescale. However, the related task will stuck in cancelling for java.lang.NoClassDefFoundError.

!image-2022-01-07-19-08-49-038.png|width=743,height=157!

In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk.

!image-2022-01-07-19-11-39-448.png|width=1085,height=400!

In this case, maybe we should terminate the container at once.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)