You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Yunfeng Zhou (Jira)" <ji...@apache.org> on 2022/07/26 06:32:00 UTC

[jira] [Commented] (FLINK-28684) NullPointerException at OneHotEncoder.GenerateModelDataOperator.snapshot

    [ https://issues.apache.org/jira/browse/FLINK-28684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17571221#comment-17571221 ] 

Yunfeng Zhou commented on FLINK-28684:
--------------------------------------

This problem occurs when a checkpoint barrier reaches a `GenerateModelDataOperator` before the first `StreamRecord` arrives at the operator. In this case, the `maxIndices` variable is not initialized and thus a `null` value is attempted to be stored in the snapshot. The solution is to provide a proper initial value for this variable.

> NullPointerException at OneHotEncoder.GenerateModelDataOperator.snapshot
> ------------------------------------------------------------------------
>
>                 Key: FLINK-28684
>                 URL: https://issues.apache.org/jira/browse/FLINK-28684
>             Project: Flink
>          Issue Type: New Feature
>          Components: Library / Machine Learning
>    Affects Versions: ml-2.1.0, ml-2.2.0
>            Reporter: Zhipeng Zhang
>            Assignee: Yunfeng Zhou
>            Priority: Major
>
> E Caused by: org.apache.flink.runtime.checkpoint.CheckpointException: Could not complete snapshot 1 for operator GenerateModelDataOperator -> *anonymous_datastream_source$215*[229] -> ConstraintEnforcer[230] -> TableToDataSteam (1/1)#0. Failure reason: Checkpoint was declined. 
> [193|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:194]E at org.apache.flink.streaming.api.operators.StreamOperatorStateHandler.snapshotState(StreamOperatorStateHandler.java:269) 
> [194|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:195]E at org.apache.flink.streaming.api.operators.StreamOperatorStateHandler.snapshotState(StreamOperatorStateHandler.java:173) 
> [195|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:196]E at org.apache.flink.streaming.api.operators.AbstractStreamOperator.snapshotState(AbstractStreamOperator.java:348) 
> [196|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:197]E at org.apache.flink.streaming.runtime.tasks.RegularOperatorChain.checkpointStreamOperator(RegularOperatorChain.java:227) 
> [197|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:198]E at org.apache.flink.streaming.runtime.tasks.RegularOperatorChain.buildOperatorSnapshotFutures(RegularOperatorChain.java:212) 
> [198|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:199]E at org.apache.flink.streaming.runtime.tasks.RegularOperatorChain.snapshotState(RegularOperatorChain.java:192) 
> [199|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:200]E at org.apache.flink.streaming.runtime.tasks.SubtaskCheckpointCoordinatorImpl.takeSnapshotSync(SubtaskCheckpointCoordinatorImpl.java:647) 
> [200|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:201]E at org.apache.flink.streaming.runtime.tasks.SubtaskCheckpointCoordinatorImpl.checkpointState(SubtaskCheckpointCoordinatorImpl.java:320) 
> [201|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:202]E at org.apache.flink.streaming.runtime.tasks.StreamTask.lambda$performCheckpoint$12(StreamTask.java:1253) 
> [202|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:203]E at org.apache.flink.streaming.runtime.tasks.StreamTaskActionExecutor$1.runThrowing(StreamTaskActionExecutor.java:50) 
> [203|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:204]E at org.apache.flink.streaming.runtime.tasks.StreamTask.performCheckpoint(StreamTask.java:1241) 
> [204|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:205]E at org.apache.flink.streaming.runtime.tasks.StreamTask.triggerCheckpointOnBarrier(StreamTask.java:1198) 
> [205|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:206]E ... 22 more 
> [206|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:207]E Caused by: java.lang.NullPointerException 
> [207|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:208]E at org.apache.flink.api.common.typeutils.base.GenericArraySerializer.copy(GenericArraySerializer.java:92) 
> [208|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:209]E at org.apache.flink.api.common.typeutils.base.GenericArraySerializer.copy(GenericArraySerializer.java:37) 
> [209|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:210]E at org.apache.flink.runtime.state.ArrayListSerializer.copy(ArrayListSerializer.java:75) 
> [210|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:211]E at org.apache.flink.runtime.state.PartitionableListState.<init>(PartitionableListState.java:65) 
> [211|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:212]E at org.apache.flink.runtime.state.PartitionableListState.deepCopy(PartitionableListState.java:79) 
> [212|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:213]E at org.apache.flink.runtime.state.DefaultOperatorStateBackendSnapshotStrategy.syncPrepareResources(DefaultOperatorStateBackendSnapshotStrategy.java:77) 
> [213|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:214]E at org.apache.flink.runtime.state.DefaultOperatorStateBackendSnapshotStrategy.syncPrepareResources(DefaultOperatorStateBackendSnapshotStrategy.java:36) 
> [214|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:215]E at org.apache.flink.runtime.state.SnapshotStrategyRunner.snapshot(SnapshotStrategyRunner.java:77) 
> [215|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:216]E at org.apache.flink.runtime.state.DefaultOperatorStateBackend.snapshot(DefaultOperatorStateBackend.java:230) 
> [216|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:217]E at org.apache.flink.streaming.api.operators.StreamOperatorStateHandler.snapshotState(StreamOperatorStateHandler.java:230) 
> [217|https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true#step:9:218]E ... 33 more
>  
>  
> https://github.com/apache/flink-ml/runs/7512316012?check_suite_focus=true



--
This message was sent by Atlassian Jira
(v8.20.10#820010)