You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Sophie Blee-Goldman (Jira)" <ji...@apache.org> on 2019/12/06 19:00:00 UTC
[jira] [Created] (KAFKA-9282) Consider more flexible node grouping
for Pattern subscription
Sophie Blee-Goldman created KAFKA-9282:
------------------------------------------
Summary: Consider more flexible node grouping for Pattern subscription
Key: KAFKA-9282
URL: https://issues.apache.org/jira/browse/KAFKA-9282
Project: Kafka
Issue Type: Improvement
Components: streams
Reporter: Sophie Blee-Goldman
The current grouping for input topics using pattern subscription creates a single node group for all matching topics, meaning the number of tasks scales with the maximum partition count across all topics. This reduces overhead and is suitable for some scenarios, but limits the ability to scale out and prevents easily parallelized processing of completely independent partitions. We should consider making it possible for the number of tasks to instead scale with the total number of partitions summed across all matching input topics.
Ideally Streams could just autoscale based on some heuristic and the currently available resources, but we would have to be careful if those things change. Alternatively we could just leave this up to the user to decide, potentially by augmenting the Pattern-based source KStream method with a new overload allowing this grouping to be specified. For example
{code:java}
StreamsBuilder {
public KStream stream(topicPattern, numTasks);
}
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)