You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "James Cheng (JIRA)" <ji...@apache.org> on 2018/07/10 05:13:00 UTC

[jira] [Created] (KAFKA-7144) Kafka Streams doesn't properly balance partition assignment

James Cheng created KAFKA-7144:
----------------------------------

             Summary: Kafka Streams doesn't properly balance partition assignment
                 Key: KAFKA-7144
                 URL: https://issues.apache.org/jira/browse/KAFKA-7144
             Project: Kafka
          Issue Type: Bug
          Components: streams
    Affects Versions: 1.1.0
            Reporter: James Cheng
         Attachments: OneThenTwelve.java

Kafka Streams doesn't always spread the tasks across all available instances/threads

I have a topology which consumes a single partition topic and goes .through() a 12 partition topic. The makes 13 partitions.

 

I then started 2 instances of the application. I would have expected the 13 partitions to be split across the 2 instances roughly evenly (7 partitions on one, 6 partitions on the other).

Instead, one instance gets 12 partitions, and the other instance gets 1 partition.

 

Repro case attached. I ran it a couple times, and it was fairly repeatable.

Setup for the repro:
{code:java}
$ ./bin/kafka-topics.sh --zookeeper localhost --create --topic one --partitions 1 --replication-factor 1 
$ ./bin/kafka-topics.sh --zookeeper localhost --create --topic twelve --partitions 12 --replication-factor 1
$ echo foo | kafkacat -P -b 127.0.0.1 -t one
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)