You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Sophie Blee-Goldman (Jira)" <ji...@apache.org> on 2019/12/06 19:00:00 UTC

[jira] [Created] (KAFKA-9282) Consider more flexible node grouping for Pattern subscription

Sophie Blee-Goldman created KAFKA-9282:
------------------------------------------

             Summary: Consider more flexible node grouping for Pattern subscription
                 Key: KAFKA-9282
                 URL: https://issues.apache.org/jira/browse/KAFKA-9282
             Project: Kafka
          Issue Type: Improvement
          Components: streams
            Reporter: Sophie Blee-Goldman


The current grouping for input topics using pattern subscription creates a single node group for all matching topics, meaning the number of tasks scales with the maximum partition count across all topics. This reduces overhead and is suitable for some scenarios, but limits the ability to scale out and prevents easily parallelized processing of completely independent partitions. We should consider making it possible for the number of tasks to instead scale with the total number of partitions summed across all matching input topics.

Ideally Streams could just autoscale based on some heuristic and the currently available resources, but we would have to be careful if those things change. Alternatively we could just leave this up to the user to decide, potentially by augmenting the Pattern-based source KStream method with a new overload allowing this grouping to be specified. For example
{code:java}
StreamsBuilder {
  public KStream stream(topicPattern, numTasks);
}
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)