You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zookeeper.apache.org by "Vishal K (JIRA)" <ji...@apache.org> on 2010/11/30 20:40:11 UTC
[jira] Commented: (ZOOKEEPER-883) Idle cluster increasingly
consumes CPU resources
[ https://issues.apache.org/jira/browse/ZOOKEEPER-883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965374#action_12965374 ]
Vishal K commented on ZOOKEEPER-883:
------------------------------------
Hi,
If this problem is reproducible, can you try with the patch attached for ZOOKEEPER-880. Going through the logs, the patch for ZOOKEEPER-880 will only fix part of the problem - it will prevent leak of SendWorker thread.
Logs show that there are 320 RecvWorker threads blocked in a read() from remote socket. 2 of these threads should be legitimate threads to remote servers. Very likely rest of them are from Nagios.
There are 2 cases I can think of that can lead to this situation:
1. Nagios may not be closing a connection. If Nagios was closing connections, then the read() should have received an exception.
2. Monitoring frequency set too high
-Vishal
> Idle cluster increasingly consumes CPU resources
> ------------------------------------------------
>
> Key: ZOOKEEPER-883
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-883
> Project: ZooKeeper
> Issue Type: Bug
> Components: server
> Affects Versions: 3.3.1
> Reporter: Lars George
> Attachments: Archive.zip
>
>
> Monitoring the ZooKeeper nodes by polling the various ports using Nagios' open port checks seems to cause a substantial raise of CPU being used by the ZooKeeper daemons. Over the course of a week an idle cluster grew from a baseline 2% to >10% CPU usage. Attached is a stack dump and logs showing the occupied threads. At the end the daemon starts failing on "too many open files" errors as all handles are used up.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.