You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Liu (Jira)" <ji...@apache.org> on 2022/01/11 10:09:00 UTC
[jira] [Updated] (FLINK-25566) Fail to cancel task if disk is bad for java.lang.NoClassDefFoundError
[ https://issues.apache.org/jira/browse/FLINK-25566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Liu updated FLINK-25566:
------------------------
Description:
When disk error, the related task will stuck for java.lang.NoClassDefFoundError.
!image-2022-01-07-19-08-49-038.png|width=743,height=157!
In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk.
!image-2022-01-07-19-11-39-448.png|width=1085,height=400!
In this case, maybe we should terminate the container at once.
was:
When we detecting disk error, we will restart the job to rescale. However, the related task will stuck in cancelling for java.lang.NoClassDefFoundError.
!image-2022-01-07-19-08-49-038.png|width=743,height=157!
In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk.
!image-2022-01-07-19-11-39-448.png|width=1085,height=400!
In this case, maybe we should terminate the container at once.
> Fail to cancel task if disk is bad for java.lang.NoClassDefFoundError
> ---------------------------------------------------------------------
>
> Key: FLINK-25566
> URL: https://issues.apache.org/jira/browse/FLINK-25566
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / Task
> Reporter: Liu
> Priority: Major
> Attachments: image-2022-01-07-19-07-10-968.png, image-2022-01-07-19-08-49-038.png, image-2022-01-07-19-11-39-448.png
>
>
> When disk error, the related task will stuck for java.lang.NoClassDefFoundError.
> !image-2022-01-07-19-08-49-038.png|width=743,height=157!
> In the TaskManagerRunner's method onFatalError, it will not terminateJVM at once. The process will stuck in the disk.
> !image-2022-01-07-19-11-39-448.png|width=1085,height=400!
> In this case, maybe we should terminate the container at once.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)