You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Joe Stein (JIRA)" <ji...@apache.org> on 2014/08/10 10:05:12 UTC

[jira] [Commented] (KAFKA-1585) Client: Infinite "conflict in /consumers/"

    [ https://issues.apache.org/jira/browse/KAFKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14092050#comment-14092050 ] 

Joe Stein commented on KAFKA-1585:
----------------------------------

FWIW there were a lot of bug fixes released in the Zookeeper 3.4.6 http://zookeeper.apache.org/doc/r3.4.6/releasenotes.html from 3.4.5 version.

You could be hitting ZOOKEEPER-1382 which was fixed in the 3.4.6 release

Current Kafka 0.8.1.1 zookeeper recommend https://kafka.apache.org/documentation.html#zk though folks are using 3.4.6 in production and that should be Zookeeper version for 0.8.2

In regards to your logs, before this happened it looks like you had errors and then a reconnect and consumer shutdown

Line 132356: 18:31:38,948 [7-cloudera:2181] INFO  kafka.utils.Logging$class - [Q_dev-1407608193903-1cb30b18], Q_dev-1407608193903-1cb30b18-0 attempting to claim partition 0
Line 132357: 18:31:38,975 [26-d7f0e66a-0-0] ERROR kafka.utils.Logging$class - [ConsumerFetcherThread-Q_dev-1407608195226-d7f0e66a-0-0], Current offset 15 for partition [gk.q.event,0] out of range; reset offset to 0
Line 132358: 18:31:38,980 [62-1d81f64b-0-0] ERROR kafka.utils.Logging$class - [ConsumerFetcherThread-Q_dev-1407608193962-1d81f64b-0-0], Current offset 4 for partition [gk.q.mail.api,0] out of range; reset offset to 0
Line 132359: 18:31:38,994 [84-ceea5788-0-0] WARN  kafka.utils.Logging$class - Reconnect due to socket error: null
Line 132360: 18:31:38,995 [84-ceea5788-0-0] INFO  kafka.utils.Logging$class - [ConsumerFetcherThread-dev_dev-1407608194884-ceea5788-0-0], Stopped 
Line 132361: 18:31:38,995 [atcher_executor] INFO  kafka.utils.Logging$class - [ConsumerFetcherThread-dev_dev-1407608194884-ceea5788-0-0], Shutdown completed
Line 132362: 18:31:38,995 [atcher_executor] INFO  kafka.utils.Logging$class - [ConsumerFetcherManager-1407608194890] All connections stopped
Line 132363: 18:31:38,996 [atcher_executor] INFO  kafka.utils.Logging$class - [dev_dev-1407608194884-ceea5788], Cleared all relevant queues for this fetcher
Line 132364: 18:31:38,996 [atcher_executor] INFO  kafka.utils.Logging$class - [dev_dev-1407608194884-ceea5788], Cleared the data chunks in all the consumer message iterators
Line 132365: 18:31:38,996 [atcher_executor] INFO  kafka.utils.Logging$class - [dev_dev-1407608194884-ceea5788], Committing all offsets after clearing the fetcher queues
Line 132366: 18:31:38,996 [atcher_executor] INFO  kafka.utils.Logging$class - [dev_dev-1407608194884-ceea5788], Releasing partition ownership
Line 132367: 18:31:39,005 [7-cloudera:2181] INFO  kafka.utils.Logging$class - conflict in /consumers/Q/owners/gk.q.log/0 data: Q_dev-1407608193903-1cb30b18-0 stored data: Q_dev-1407608205503-9cfb99aa-0

likely what happened is when it reconnected the timeout with zk never occurred and it got stuck there.  Could be the Zk bug, could also be related somewhat to KAFKA-1387 or KAFKA-1451 I will link the JIRAs so when we test 0.8.2 see about reproducing this on a good zk version

To resolve that you can stop the consumer, wait for the zk nodes to expire and start up the consumers again.



> Client: Infinite "conflict in /consumers/"
> ------------------------------------------
>
>                 Key: KAFKA-1585
>                 URL: https://issues.apache.org/jira/browse/KAFKA-1585
>             Project: Kafka
>          Issue Type: Bug
>          Components: consumer
>    Affects Versions: 0.8.1.1
>            Reporter: Artur Denysenko
>            Priority: Critical
>             Fix For: 0.8.2
>
>         Attachments: kafka_consumer_ephemeral_node_extract.zip
>
>
> Periodically we have kafka consumers cycling in "conflict in /consumers/" and "I wrote this conflicted ephemeral node". 
> Please see attached log extract.
> After restarting the process kafka consumers are working perfectly. 
> We are using Zookeeper 3.4.5



--
This message was sent by Atlassian JIRA
(v6.2#6252)