You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Liu (Jira)" <ji...@apache.org> on 2022/01/11 10:09:00 UTC

[jira] [Updated] (FLINK-25566) Fail to cancel task if disk is bad for java.lang.NoClassDefFoundError

     [ https://issues.apache.org/jira/browse/FLINK-25566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Liu updated FLINK-25566:
------------------------
    Description: 
When disk error, the related task will stuck for java.lang.NoClassDefFoundError.

!image-2022-01-07-19-08-49-038.png|width=743,height=157!

In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk.

!image-2022-01-07-19-11-39-448.png|width=1085,height=400!

In this case, maybe we should terminate the container at once.

  was:
When we detecting disk error, we will restart the job to rescale. However, the related task will stuck in cancelling for java.lang.NoClassDefFoundError.

!image-2022-01-07-19-08-49-038.png|width=743,height=157!

In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk.

!image-2022-01-07-19-11-39-448.png|width=1085,height=400!

In this case, maybe we should terminate the container at once.


> Fail to cancel task if disk is bad for java.lang.NoClassDefFoundError
> ---------------------------------------------------------------------
>
>                 Key: FLINK-25566
>                 URL: https://issues.apache.org/jira/browse/FLINK-25566
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Task
>            Reporter: Liu
>            Priority: Major
>         Attachments: image-2022-01-07-19-07-10-968.png, image-2022-01-07-19-08-49-038.png, image-2022-01-07-19-11-39-448.png
>
>
> When disk error, the related task will stuck for java.lang.NoClassDefFoundError.
> !image-2022-01-07-19-08-49-038.png|width=743,height=157!
> In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk.
> !image-2022-01-07-19-11-39-448.png|width=1085,height=400!
> In this case, maybe we should terminate the container at once.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)