You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Lei Wang (JIRA)" <ji...@apache.org> on 2016/07/18 18:56:20 UTC

[jira] [Commented] (KAFKA-3971) Consumers drop from coordinator and cannot reconnet

    [ https://issues.apache.org/jira/browse/KAFKA-3971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15382833#comment-15382833 ] 

Lei Wang commented on KAFKA-3971:
---------------------------------

consumer config:
{noformat}
    Properties props = new Properties();
    props.put("bootstrap.servers", "<the server list>");
    String me = clientId + UID;
    props.put("group.id", me);
    props.put("client.id", me);
    props.put("auto.offset.reset", "earliest");
    props.put("enable.auto.commit", "true");
    props.put("auto.commit.interval.ms", "1000");
    props.put("session.timeout.ms", "30000");
    props.put("key.deserializer", "org.apache.kafka.common.serialization.LongDeserializer");
    props.put("value.deserializer", "org.apache.kafka.common.serialization.ByteArrayDeserializer");

    consumer = new KafkaConsumer<>(props);
    consumer.subscribe(Arrays.asList(topic));
{noformat}

> Consumers drop from coordinator and cannot reconnet
> ---------------------------------------------------
>
>                 Key: KAFKA-3971
>                 URL: https://issues.apache.org/jira/browse/KAFKA-3971
>             Project: Kafka
>          Issue Type: Bug
>          Components: consumer
>    Affects Versions: 0.9.0.1
>         Environment: version 0.9.0.1
>            Reporter: Lei Wang
>         Attachments: KAFKA-3971.txt
>
>
> From time to time, we're creating new topics, and all consumers will pickup those new topics. When starting to consume from these new topics, we often see some of random consumers cannot connect to the coordinator. The log will be flushed with the following log message tens of thousands every second:
> {noformat}
> 16/07/18 18:18:36.003 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.004 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> 16/07/18 18:18:36.005 INFO (AbstractCoordinator.java:529): Marking the coordinator 2147483645 dead.
> {noformat}
> the servers seem working fine, and other consumers are also happy.
> from the log, looks like it's retrying multiple times every millisecond but all failing.
> the same process are consuming from many topics, some of them are still working well, but those random topics will fail.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)