You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Jay Kreps (JIRA)" <ji...@apache.org> on 2015/02/08 00:42:35 UTC

[jira] [Resolved] (KAFKA-462) ZK thread crashing doesn't bring down the broker (and doesn't come back up).

     [ https://issues.apache.org/jira/browse/KAFKA-462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jay Kreps resolved KAFKA-462.
-----------------------------
    Resolution: Won't Fix

> ZK thread crashing doesn't bring down the broker (and doesn't come back up).
> ----------------------------------------------------------------------------
>
>                 Key: KAFKA-462
>                 URL: https://issues.apache.org/jira/browse/KAFKA-462
>             Project: Kafka
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 0.7
>            Reporter: Matt Jones
>
> I think the simplest explanation is the traceback. The broker had been up starting at 2012-07-31 18:45:42,951 (based upon the 'Starting Kafka server' log entry), and the error was fixed with a restart of the broker at 2012-08-14 20:59:41,581.
> It looks like zookeeper thread crashed, but the broker kept operating as usual. The expected behavior would be that the zookeeper thread crashing would cause the whole broker to crash, or the zookeeper thread would start itself back up.
> [2012-08-08 01:25:13,398] 624270894 [main-SendThread(zookeeper001:2181)] INFO  org.apache.zookeeper.ClientCnxn  - Client session timed out, have not heard from server in 8749ms for sessionid 0x138e4edc04c1e50, closing socket connection and attempting reconnect
>  [2012-08-08 01:25:15,136] 624272632 [main-EventThread] INFO  org.I0Itec.zkclient.ZkClient  - zookeeper state changed (Disconnected)
>  [2012-08-08 01:25:15,702] 624273198 [main-SendThread(zookeeper001:2181)] INFO  org.apache.zookeeper.ClientCnxn  - Opening socket connection to server zookeeper003/10.125.95.193:2181
>  [2012-08-08 01:25:15,704] 624273200 [main-SendThread(zookeeper003:2181)] INFO  org.apache.zookeeper.ClientCnxn  - Socket connection established to zookeeper003/10.125.95.193:2181, initiating session
>  [2012-08-08 01:25:15,709] 624273205 [main-EventThread] INFO  org.I0Itec.zkclient.ZkClient  - zookeeper state changed (Expired)
>  [2012-08-08 01:25:15,709] 624273205 [main-EventThread] INFO  org.apache.zookeeper.ZooKeeper  - Initiating client connection, connectString=zookeeper001:2181,zookeeper002:2181,zookeeper003:2181 sessionTimeout=6000 watcher=org.I0Itec.zkclient.ZkClient@26d66426
>  [2012-08-08 01:25:21,514] 624279010 [main-SendThread(zookeeper003:2181)] INFO  org.apache.zookeeper.ClientCnxn  - Unable to reconnect to ZooKeeper service, session 0x138e4edc04c1e50 has expired, closing socket connection
>  [2012-08-08 01:25:47,135] 624304631 [main-EventThread] ERROR org.apache.zookeeper.ClientCnxn  - Error while calling watcher 
> 	at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:530)
> 	at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:506)
> Caused by: org.I0Itec.zkclient.exception.ZkException: Unable to connect to zookeeper001:2181,zookeeper002:2181,zookeeper003:2181
> Caused by: java.net.UnknownHostException: zookeeper001
> 	at org.apache.zookeeper.ClientCnxn.<init>(ClientCnxn.java:386)
> 	at org.apache.zookeeper.ClientCnxn.<init>(ClientCnxn.java:331)
> 	at org.apache.zookeeper.ZooKeeper.<init>(ZooKeeper.java:377)
> [2012-08-08 01:25:48,620] 624306116 [main-EventThread] INFO  org.apache.zookeeper.ClientCnxn  - EventThread shut down



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)