You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Mirza Aliev (Jira)" <ji...@apache.org> on 2022/06/24 14:41:00 UTC

[jira] [Comment Edited] (IGNITE-16423) [%my-first-node%JRaft-Common-Executor-1][SnapshotExecutorImpl] Fail to close writer

    [ https://issues.apache.org/jira/browse/IGNITE-16423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17558524#comment-17558524 ] 

Mirza Aliev edited comment on IGNITE-16423 at 6/24/22 2:40 PM:
---------------------------------------------------------------

I've checked several recent logs from {{main}} branch for modules like runner, raft, example, table, and I haven't found failures with the same error. 

Also we have checked the logs and the corresponding places  where such error could be thrown, and we have noticed that the original reason of failures is not preserved. 

So, we decided to backport some improvement in logging of this part from JRaft, so if in the future such problem appears again, we can receive more information about the roots of the problem. Corresponding backport: https://github.com/sofastack/sofa-jraft/commit/e91e660144db852e4a2a62435e6c40a044c4b4cb


was (Author: maliev):
I've checked several recent logs from main for modules like runner, raft, example, table, and I haven't found failures with the same error. 

Also we have checked the logs and the corresponding places  where such error could be thrown, and we have noticed that the original reason of failures is not preserved. 

So, we decided to backport some improvement in logging of this part from JRaft, so if in the future such problem appears again, we can receive more information about the roots of the problem. Corresponding backport: https://github.com/sofastack/sofa-jraft/commit/e91e660144db852e4a2a62435e6c40a044c4b4cb

> [%my-first-node%JRaft-Common-Executor-1][SnapshotExecutorImpl] Fail to close writer
> -----------------------------------------------------------------------------------
>
>                 Key: IGNITE-16423
>                 URL: https://issues.apache.org/jira/browse/IGNITE-16423
>             Project: Ignite
>          Issue Type: Bug
>    Affects Versions: 3.0.0-alpha3
>            Reporter: Fedor Malchikov 
>            Assignee: Sergey Uttsel
>            Priority: Critical
>              Labels: ignite-3
>         Attachments: my-first-node(sql).log, my-first-node.log
>
>
> {noformat}
> 2022-01-28 13:17:41:189 +0300 [INFO][%my-first-node%JRaft-Common-Executor-1][LocalSnapshotStorage] Deleting snapshot /home/prom1se/GG/apache/ignite-3/modules/cli/target/ignite-work/data/my-first-node/metastorage_raft_group_127.0.1.1_3344/snapshot/temp.
> 2022-01-28 13:17:41:192 +0300 [ERROR][%my-first-node%JRaft-Common-Executor-1][SnapshotExecutorImpl] Fail to close writer
> java.io.IOException
> 	at org.apache.ignite.raft.jraft.storage.snapshot.local.LocalSnapshotStorage.close(LocalSnapshotStorage.java:242)
> 	at org.apache.ignite.raft.jraft.storage.snapshot.local.LocalSnapshotWriter.close(LocalSnapshotWriter.java:93)
> 	at org.apache.ignite.raft.jraft.storage.snapshot.local.LocalSnapshotWriter.close(LocalSnapshotWriter.java:88)
> 	at org.apache.ignite.raft.jraft.storage.snapshot.SnapshotExecutorImpl.onSnapshotSaveDone(SnapshotExecutorImpl.java:387)
> 	at org.apache.ignite.raft.jraft.storage.snapshot.SnapshotExecutorImpl$SaveSnapshotDone.continueRun(SnapshotExecutorImpl.java:135)
> 	at org.apache.ignite.raft.jraft.storage.snapshot.SnapshotExecutorImpl$SaveSnapshotDone.lambda$run$0(SnapshotExecutorImpl.java:131)
> 	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> 	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> 	at java.base/java.lang.Thread.run(Thread.java:829)
> {noformat}
> And after that, the timeout error repeats every 10 seconds.
> The problem is reproducible stably, but each time needs a different time.  In attachment 2 different tests: one without any activity and one hour wait , second after SQL table creation 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)