You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Biao Liu (JIRA)" <ji...@apache.org> on 2018/10/10 06:55:00 UTC

[jira] [Commented] (FLINK-9845) Make InternalTimerService's timer processing interruptible/abortable

    [ https://issues.apache.org/jira/browse/FLINK-9845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16644530#comment-16644530 ] 

Biao Liu commented on FLINK-9845:
---------------------------------

[~till.rohrmann] This proposal sounds great. We have encountered the problem. If nobody is working on this, I'd like take it.

> Make InternalTimerService's timer processing interruptible/abortable
> --------------------------------------------------------------------
>
>                 Key: FLINK-9845
>                 URL: https://issues.apache.org/jira/browse/FLINK-9845
>             Project: Flink
>          Issue Type: Improvement
>          Components: State Backends, Checkpointing
>    Affects Versions: 1.5.1, 1.6.0
>            Reporter: Till Rohrmann
>            Priority: Major
>             Fix For: 1.7.0
>
>
> When cancelling a {{Task}}, the task thread might currently process the timers registered at the {{InternalTimerService}}. Depending on the timer action, this might take a while and, thus, blocks the cancellation of the {{Task}}. In the most extreme case, the {{TaskCancelerWatchDog}} kicks in and kills the whole {{TaskManager}} process.
> In order to alleviate the problem (speed up the cancellation reaction), we should make the processing of the timers interruptible/abortable. This means that instead of processing all timers we should check in between timers whether the {{Task}} is currently being cancelled or not. If this is the case, then we should directly stop processing the remaining timers and return.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)