You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "David Capwell (Jira)" <ji...@apache.org> on 2020/11/19 19:24:00 UTC
[jira] [Comment Edited] (CASSANDRA-15214) Internode messaging
catches OOMs and does not rethrow
[ https://issues.apache.org/jira/browse/CASSANDRA-15214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235604#comment-17235604 ]
David Capwell edited comment on CASSANDRA-15214 at 11/19/20, 7:23 PM:
----------------------------------------------------------------------
CI Results: Yellow. Expected COMPACT STORAGE upgrade test that Alex is looking into, and org.apache.cassandra.distributed.test.SimpleReadWriteTest, though that test passes when I reran locally
||Branch||Source||Circle CI||Jenkins||
|trunk|[branch|https://github.com/dcapwell/cassandra/tree/commit_remote_branch/CASSANDRA-15214-trunk-C01561D4-BCE5-4B0B-B8F3-4D57E308657A]|[build|https://app.circleci.com/pipelines/github/dcapwell/cassandra?branch=commit_remote_branch%2FCASSANDRA-15214-trunk-C01561D4-BCE5-4B0B-B8F3-4D57E308657A]|[build|https://ci-cassandra.apache.org/job/Cassandra-devbranch/221/]|
was (Author: dcapwell):
Starting commit
CI Results (pending):
||Branch||Source||Circle CI||Jenkins||
|trunk|[branch|https://github.com/dcapwell/cassandra/tree/commit_remote_branch/CASSANDRA-15214-trunk-C01561D4-BCE5-4B0B-B8F3-4D57E308657A]|[build|https://app.circleci.com/pipelines/github/dcapwell/cassandra?branch=commit_remote_branch%2FCASSANDRA-15214-trunk-C01561D4-BCE5-4B0B-B8F3-4D57E308657A]|[build|https://ci-cassandra.apache.org/job/Cassandra-devbranch/221/]|
> Internode messaging catches OOMs and does not rethrow
> -----------------------------------------------------
>
> Key: CASSANDRA-15214
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15214
> Project: Cassandra
> Issue Type: Bug
> Components: Messaging/Client, Messaging/Internode
> Reporter: Benedict Elliott Smith
> Assignee: Yifan Cai
> Priority: Normal
> Fix For: 4.0-beta4
>
> Attachments: oom-experiments.zip
>
> Time Spent: 1h 40m
> Remaining Estimate: 0h
>
> Netty (at least, and perhaps elsewhere in Executors) catches all exceptions, so presently there is no way to ensure that an OOM reaches the JVM handler to trigger a crash/heapdump.
> It may be that the simplest most consistent way to do this would be to have a single thread spawned at startup that waits for any exceptions we must propagate to the Runtime.
> We could probably submit a patch upstream to Netty, but for a guaranteed future proof approach, it may be worth paying the cost of a single thread.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@cassandra.apache.org
For additional commands, e-mail: commits-help@cassandra.apache.org