You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Till Rohrmann (JIRA)" <ji...@apache.org> on 2018/10/18 08:20:00 UTC

[jira] [Commented] (FLINK-10557) Checkpoint size metric incorrectly reports the same value until restart

    [ https://issues.apache.org/jira/browse/FLINK-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16654848#comment-16654848 ] 

Till Rohrmann commented on FLINK-10557:
---------------------------------------

Have you checked wether the same problem occurs with Flink 1.5 or 1.6 [~randrews]?

> Checkpoint size metric incorrectly reports the same value until restart
> -----------------------------------------------------------------------
>
>                 Key: FLINK-10557
>                 URL: https://issues.apache.org/jira/browse/FLINK-10557
>             Project: Flink
>          Issue Type: Bug
>          Components: Metrics
>    Affects Versions: 1.4.0
>            Reporter: Ruby Andrews
>            Priority: Major
>
> We have seen the following several times, but have not found the root cause. 
> The checkpoint size metric will sometimes report the same value over and over, even though the checkpoint size is changing. The last time we saw this, it happened for 4 days, until we re-started the Flink cluster. In that time period, the application flushes all data each day so we would expect to see the checkpoint size grow until UTC midnights, then go to about 0 and begin growing again.
> It appears that the metrics continue to be gathered, because we see them in our data repository where we are reporting them. However, the size does not change.  
> Is there more information we can gather to root cause this if it happens again?
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)