You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Aride Chettali (JIRA)" <ji...@apache.org> on 2016/06/01 12:57:59 UTC
[jira] [Issue Comment Deleted] (FLINK-4000) Exception: Could not
restore checkpointed state to operators and functions; during Job Restart
(Job restart is triggered due to one of the task manager failure)
[ https://issues.apache.org/jira/browse/FLINK-4000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Aride Chettali updated FLINK-4000:
----------------------------------
Comment: was deleted
(was: I was using the default state check-points (in-memory).
And I have figured out the issue. The issue was user function did not implement the "Checkpointed" interface. Once I implemented the functions _snapshotState_ and _restoreState_ then my job got restarted successfully.
Thanks for your help..)
> Exception: Could not restore checkpointed state to operators and functions; during Job Restart (Job restart is triggered due to one of the task manager failure)
> -----------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: FLINK-4000
> URL: https://issues.apache.org/jira/browse/FLINK-4000
> Project: Flink
> Issue Type: Bug
> Components: DataStream API
> Affects Versions: 1.0.3
> Environment: //Fault Tolerance Configuration of the Job
> env.enableCheckpointing(5000); env.getCheckpointConfig().setCheckpointingMode(CheckpointingMode.EXACTLY_ONCE);
> env.getCheckpointConfig().setMaxConcurrentCheckpoints(1);
> env.setRestartStrategy(RestartStrategies.fixedDelayRestart( 3,10000));
> Reporter: Aride Chettali
>
> java.lang.Exception: Could not restore checkpointed state to operators and functions
> at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreState(StreamTask.java:457)
> at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:209)
> at org.apache.flink.runtime.taskmanager.Task.run(Task.java:559)
> at java.lang.Thread.run(Thread.java:745)
> Caused by: java.lang.Exception: Failed to restore state to function: null
> at org.apache.flink.streaming.api.operators.AbstractUdfStreamOperator.restoreState(AbstractUdfStreamOperator.java:168)
> at org.apache.flink.streaming.runtime.tasks.StreamTask.restoreState(StreamTask.java:449)
> ... 3 more
> Caused by: java.lang.NullPointerException
> at org.apache.flink.streaming.api.functions.source.MessageAcknowledgingSourceBase.restoreState(MessageAcknowledgingSourceBase.java:184)
> at org.apache.flink.streaming.api.functions.source.MessageAcknowledgingSourceBase.restoreState(MessageAcknowledgingSourceBase.java:80)
> at org.apache.flink.streaming.api.operators.AbstractUdfStreamOperator.restoreState(AbstractUdfStreamOperator.java:165)
> ... 4 more
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)