You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@kafka.apache.org by "Greg Harris (Jira)" <ji...@apache.org> on 2020/07/20 05:54:00 UTC

[jira] [Created] (KAFKA-10295) ConnectDistributedTest.test_bounce should wait for graceful stop

Greg Harris created KAFKA-10295:
-----------------------------------

             Summary: ConnectDistributedTest.test_bounce should wait for graceful stop
                 Key: KAFKA-10295
                 URL: https://issues.apache.org/jira/browse/KAFKA-10295
             Project: Kafka
          Issue Type: Test
          Components: KafkaConnect
    Affects Versions: 2.4.1, 2.5.0, 2.3.1, 2.6.0
            Reporter: Greg Harris
            Assignee: Greg Harris


In ConnectDistributedTest.test_bounce, there are flakey failures that appear to follow this pattern:
 # The test is parameterized for hard bounces, and with Incremental Cooperative Rebalancing enabled (does not appear for protocol=eager)
 # A source task is on a worker that will experience a hard bounce
 # The source task has written records which it has not yet committed in source offsets
 # The worker is hard-bounced, and the source task is lost
 # Incremental Cooperative Rebalance starts it's scheduled.rebalance.max.delay.ms delay before recovering the task
 # The test ends, connectors and Connect are stopped
 # The test verifies that the sink connector has only written records that have been committed by the source connector
 # This verification fails because the source offsets are stale, and there are un-committed records in the topic, and the sink connector has written at least one of them.

This can be addressed by ensuring that the test waits for the rebalance delay to expire, and for the lost task to recover and commit offsets past the progress it made before the bounce.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)