You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@flink.apache.org by Guojun Li <gj...@gmail.com> on 2022/11/03 03:30:40 UTC

How to get checkpoint stats after job has terminated

Hi, Flink User Group

I want to retrieve the last few completed checkpoints' stats even if the job has terminated, these stats are useful for restarting the job manually. Because we prefer to restore job from retained checkpoint rather than savepoint. Doc Monitoring Checkpointing <https://nightlies.apache.org/flink/flink-docs-master/docs/ops/monitoring/checkpoint_monitoring/> said these stats available after the job has terminated.

So I’m wondering
1. How to get checkpoint stats after job has terminated? 
2. How long these stats will retain?

Thanks,
Guojun



Re: How to get checkpoint stats after job has terminated

Posted by yidan zhao <hi...@gmail.com>.
First of all, you should trigger a savepoint before stopping the job,
and then you can restart the job with the savepoint.

For checkpoints, you need to set
‘execution.checkpointing.externalized-checkpoint-retention’ to
'RETAIN_ON_CANCELLATION'. You can get the checkpoints info via history
server.

Guojun Li <gj...@gmail.com> 于2022年11月3日周四 11:31写道:
>
> Hi, Flink User Group
>
> I want to retrieve the last few completed checkpoints' stats even if the job has terminated, these stats are useful for restarting the job manually. Because we prefer to restore job from retained checkpoint rather than savepoint. Doc Monitoring Checkpointing said these stats available after the job has terminated.
>
> So I’m wondering
> 1. How to get checkpoint stats after job has terminated?
> 2. How long these stats will retain?
>
> Thanks,
> Guojun
>
>