You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Alexey Romanenko (Jira)" <ji...@apache.org> on 2019/09/02 13:19:00 UTC

[jira] [Commented] (BEAM-8121) Messages are not distributed per machines when consuming from Kafka topic with 1 partition

    [ https://issues.apache.org/jira/browse/BEAM-8121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16920851#comment-16920851 ] 

Alexey Romanenko commented on BEAM-8121:
----------------------------------------

I guess it could be caused by fusion when messages, consumed by the same consumer, will be processed on the same worker.   Which runner do you use? Did you try to use {{Reshuffle.viaRandomKey()}} to break a fusion?

> Messages are not distributed per machines when consuming from Kafka topic with 1 partition
> ------------------------------------------------------------------------------------------
>
>                 Key: BEAM-8121
>                 URL: https://issues.apache.org/jira/browse/BEAM-8121
>             Project: Beam
>          Issue Type: Bug
>          Components: io-java-kafka
>    Affects Versions: 2.14.0
>            Reporter: TJ
>            Priority: Major
>
> Messages are consumed from Kafka using KafkaIO. Each kafka topic contains only 1 partition. (That means that messages can be consumed only by one Consumer per 1 consumer group)
> When backlog of topic grows and system scales from 1 to X machines, all the messages seems to be executed onĀ  the same machine on which they are read. Due to that message throughput doesn't increase comparing X machines to 1 machine. If one machine was reading 2K messagesĀ  per s, X machines will be reading the same amount.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)