You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "suganya (JIRA)" <ji...@apache.org> on 2018/01/11 11:15:00 UTC

[jira] [Created] (FLINK-8413) Checkpointing in flink doesnt maintain the snapshot state

suganya created FLINK-8413:
------------------------------

             Summary: Checkpointing in flink doesnt maintain the snapshot state
                 Key: FLINK-8413
                 URL: https://issues.apache.org/jira/browse/FLINK-8413
             Project: Flink
          Issue Type: Bug
          Components: State Backends, Checkpointing
    Affects Versions: 1.3.2
            Reporter: suganya


We have a project which consumes events from kafka,does a groupby in a time window(5 mins),after window elapses it pushes the events to downstream for merge.This project is deployed using flink ,we have enabled checkpointing to recover from failed state.

(windowsize: 5mins , checkpointingInterval: 5mins,state.backend: filesystem)

Offsets from kafka  get checkpointed every 5 mins(checkpointingInterval).Before finishing the entire DAG(groupBy and merge) , events offsets are getting checkpointed.So incase of any restart from task-manager ,new task gets started from last successful checkpoint ,but we could'nt able to get the aggregated snapshot data(data from groupBy task) from the persisted checkpoint.

Able to retrieve the last successful checkpointed offset from kafka ,but couldnt able to get last aggregated data till checkpointing.








--
This message was sent by Atlassian JIRA
(v6.4.14#64029)