You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Aride Chettali (JIRA)" <ji...@apache.org> on 2016/06/01 12:57:59 UTC

[jira] [Issue Comment Deleted] (FLINK-4000) Exception: Could not restore checkpointed state to operators and functions; during Job Restart (Job restart is triggered due to one of the task manager failure)

     [ https://issues.apache.org/jira/browse/FLINK-4000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aride Chettali updated FLINK-4000:
----------------------------------
    Comment: was deleted

(was: I was using the default state check-points (in-memory).  
And I have figured out the issue. The issue was user function did not implement the "Checkpointed" interface. Once I implemented the functions _snapshotState_  and _restoreState_ then my job got restarted successfully.
 
Thanks for your help..)

> Exception: Could not restore checkpointed state to operators and functions;  during Job Restart (Job restart is triggered due to one of the task manager failure)
> -----------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-4000
>                 URL: https://issues.apache.org/jira/browse/FLINK-4000
>             Project: Flink
>          Issue Type: Bug
>          Components: DataStream API
>    Affects Versions: 1.0.3
>         Environment: //Fault Tolerance Configuration of the Job
> env.enableCheckpointing(5000);     env.getCheckpointConfig().setCheckpointingMode(CheckpointingMode.EXACTLY_ONCE);
> env.getCheckpointConfig().setMaxConcurrentCheckpoints(1);
> env.setRestartStrategy(RestartStrategies.fixedDelayRestart( 3,10000));
>            Reporter: Aride Chettali
>
> java.lang.Exception: Could not restore checkpointed state to operators and functions
> 	at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreState(StreamTask.java:457)
> 	at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:209)
> 	at org.apache.flink.runtime.taskmanager.Task.run(Task.java:559)
> 	at java.lang.Thread.run(Thread.java:745)
> Caused by: java.lang.Exception: Failed to restore state to function: null
> 	at org.apache.flink.streaming.api.operators.AbstractUdfStreamOperator.restoreState(AbstractUdfStreamOperator.java:168)
> 	at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreState(StreamTask.java:449)
> 	... 3 more
> Caused by: java.lang.NullPointerException
> 	at org.apache.flink.streaming.api.functions.source.MessageAcknowledgingSourceBase.restoreState(MessageAcknowledgingSourceBase.java:184)
> 	at org.apache.flink.streaming.api.functions.source.MessageAcknowledgingSourceBase.restoreState(MessageAcknowledgingSourceBase.java:80)
> 	at org.apache.flink.streaming.api.operators.AbstractUdfStreamOperator.restoreState(AbstractUdfStreamOperator.java:165)
> 	... 4 more



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)