You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@kafka.apache.org by omer bar lev <om...@gmail.com> on 2020/10/01 05:56:56 UTC
Unexpected cluster rebalancing

As part of our load tests on Kafka, we are trying to create a certain
number of sink connectors, each one with a unique name and topic and
therefore, each sink connector is in his own consumer group.
For every new connector added, a rebalance is triggered for all the
connectors (the more connectors created before, the longer the current
rebalance).
Our creation is done by sending parallel create requests to the connector
RestAPI (we have tried doing this in several ways – 1000 requests, the
desired amount, at once, and also in smaller “batches” – 10/20 requests).

We have a few questions:

Why is the rebalance happening?
Why does it take so long (can be hours)?
Is the rebalance configurable?

The cluster is deployed using strimzi on kuberentes.

The connectors are created with the following template:

POST connectors/HttpSinkConnector-{index}
{

  "name": "HttpSinkConnector-{index}",

  "config": {

    "connector.class": "com. connectors.rest.sink.HttpSinkConnector",

    "tasks.max": "1",

    "topics": "http-sink-connector-{index}",

    "http.api.url": "{ some url }",

    "headers": "Content-Type:application/json"

  }

}

Connector Worker configuration:

            offset.storage.topic=connect-cluster-offsets

            value.converter=org.apache.kafka.connect.json.JsonConverter

            config.storage.topic=connect-cluster-configs

            key.converter=org.apache.kafka.connect.json.JsonConverter

            group.id=connect-cluster

            status.storage.topic=connect-cluster-status

            config.storage.replication.factor=3

            connector.client.config.override.policy=All

            consumer.override.auto.offset.reset=latest

            heartbeat.interval.ms=100000

            key.converter.schemas.enable=false

            offset.storage.replication.factor=3

            session.timeout.ms=300000

            status.storage.replication.factor=3
            value.converter.schemas.enable=false