You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "kaushik srinivas (JIRA)" <ji...@apache.org> on 2018/02/27 17:45:00 UTC

[jira] [Created] (KAFKA-6600) Kafka Bytes Out lags behind Kafka Bytes In on all brokers when topics replicated with 3 and flume kafka consumer.

kaushik srinivas created KAFKA-6600:
---------------------------------------

             Summary: Kafka Bytes Out lags behind Kafka Bytes In on all brokers when topics replicated with 3 and flume kafka consumer.
                 Key: KAFKA-6600
                 URL: https://issues.apache.org/jira/browse/KAFKA-6600
             Project: Kafka
          Issue Type: Bug
    Affects Versions: 0.10.0.1
            Reporter: kaushik srinivas


Below is the setup detail,

Kafka with 3 brokers (each broker with 10 cores and 32GBmem (12 GB heap)).

Created topic with 120 partitions and replication factor 3.

Throughput per broker is ~40k msgs/sec and bytes in ~8mb/sec.

Flume kafka source is used as the consumer.

Observations:

When the replication factor is kept 1, the bytes out and bytes in stops exactly at same timestamp(i.e when the producer to kafka is stopped).

But when the replication factor is increased to 3, there is a time lag observed in bytes out compared to bytes in. Flume kafka source is pulling data slowly. But flume is configured with very high memory and cpu configurations.

 

Tried increasing num.replica.fetchers from default value 1 to 10, 20, 50 etc and replica.fetch.max.bytes from default 1MB to 10MB,20MB. But no improvement is found to be observed in terms of the lag.

under repplicated partitions is observed to be zero using replica manager metrics in jmx.

Kafka brokers were monitored for cpu and memory, cpu is being used at 3% of total cores max and memory used at 4gb (32 Gb configured).

Flume kafka source has overriden kafka consumer properties : max.partition.fetch bytes is kept at default 1MB and fetch.max.bytes is kept at default 52MB. Flume kafka source batch size is kept at default value 1000.
 agent.sources.****.kafka.consumer.fetch.max.bytes = 10485760
 agent.sources.****.kafka.consumer.max.partition.fetch.bytes = 10485760
 agent.sources.****.batchSize = 1000
 

what more tuning is needed in order to reduce the lag between bytes in and bytes out  at kafka brokers with replication factor 3 or is there any configuration missed out?

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)