You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Yun Tang (Jira)" <ji...@apache.org> on 2019/12/16 12:48:00 UTC

[jira] [Commented] (FLINK-15280) Checkpoint end-to-end test failed

    [ https://issues.apache.org/jira/browse/FLINK-15280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16997245#comment-16997245 ] 

Yun Tang commented on FLINK-15280:
----------------------------------

[https://api.travis-ci.org/v3/job/624983426/log.txt] fails due to error:
{code:java}
2019-12-14 20:58:51,531 WARN  org.apache.flink.streaming.runtime.tasks.StreamTask           - Error while canceling task.
java.lang.IllegalStateException: Released
	at org.apache.flink.runtime.io.network.partition.consumer.SingleInputGate.getNextBufferOrEvent(SingleInputGate.java:483)
	at org.apache.flink.runtime.io.network.partition.consumer.SingleInputGate.pollNext(SingleInputGate.java:474)
	at org.apache.flink.runtime.taskmanager.InputGateWithMetrics.pollNext(InputGateWithMetrics.java:75)
	at org.apache.flink.streaming.runtime.io.CheckpointedInputGate.pollNext(CheckpointedInputGate.java:125)
	at org.apache.flink.streaming.runtime.io.StreamTaskNetworkInput.emitNext(StreamTaskNetworkInput.java:133)
	at org.apache.flink.streaming.runtime.io.StreamOneInputProcessor.processInput(StreamOneInputProcessor.java:69)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.processInput(StreamTask.java:311)
	at org.apache.flink.streaming.runtime.tasks.mailbox.MailboxProcessor.runMailboxLoop(MailboxProcessor.java:187)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.runMailboxLoop(StreamTask.java:488)
	at org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:470)
	at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:702)
	at org.apache.flink.runtime.taskmanager.Task.run(Task.java:527)
	at java.lang.Thread.run(Thread.java:748)
{code}
I think this exception should be considered as one of the white list.

However, another instance [https://api.travis-ci.org/v3/job/625281873/log.txt] fails due to some unknown error which is inundated due to maximum length limit.

 

To avoid unknown errors, I think we should either print errors out once grepped or avoid to cat metrics out. CC [~trohrmann] , [~liyu]

> Checkpoint end-to-end test failed
> ---------------------------------
>
>                 Key: FLINK-15280
>                 URL: https://issues.apache.org/jira/browse/FLINK-15280
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.10.0
>            Reporter: wangxiyuan
>            Priority: Major
>
> The Checkpoint e2e test failed. The error log is very long. Please check the link to find the error message.
> Some logs in these two days:
> [https://api.travis-ci.org/v3/job/624983426/log.txt]
> Running 'Resuming Externalized Checkpoint (rocks, incremental, no parallelism change) end-to-end test'
>  
> [https://api.travis-ci.org/v3/job/625281873/log.txt]
> Running 'Resuming Externalized Checkpoint (file, sync, scale down) end-to-end test'
>  
> And arm as well:
> [https://logs.openlabtesting.org/logs/periodic-20-flink-mail/github.com/apache/flink/master/flink-end-to-end-test-arm64-checkpoints-and-sticky/3da575a/job-output.txt.gz]
> Running 'Resuming Externalized Checkpoint (rocks, incremental, no parallelism change) end-to-end test'
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)