You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Grant Henke (JIRA)" <ji...@apache.org> on 2016/09/13 14:29:21 UTC

[jira] [Updated] (KAFKA-4157) Transient system test failure in replica_verification_test.test_replica_lags

     [ https://issues.apache.org/jira/browse/KAFKA-4157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Henke updated KAFKA-4157:
-------------------------------
    Status: Patch Available  (was: Open)

> Transient system test failure in replica_verification_test.test_replica_lags
> ----------------------------------------------------------------------------
>
>                 Key: KAFKA-4157
>                 URL: https://issues.apache.org/jira/browse/KAFKA-4157
>             Project: Kafka
>          Issue Type: Bug
>          Components: system tests
>    Affects Versions: 0.10.0.0
>            Reporter: Grant Henke
>            Assignee: Grant Henke
>
> The replica_verification_test.test_replica_lags test runs a background thread via replica_verification_tool that populates a dict with max lag for each "topic,partition" key. Because populating that map is in a separate thread, there is a race condition on populating the key and querying it via replica_verification_tool.get_lag_for_partition. This results in a key error like below: 
> {noformat}
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 106, in run_all_tests
>     data = self.run_single_test()
>   File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 162, in run_single_test
>     return self.current_test_context.function(self.current_test)
>   File "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line 82, in test_replica_lags
>     err_msg="Timed out waiting to reach zero replica lags.")
>   File "/usr/lib/python2.7/site-packages/ducktape/utils/util.py", line 31, in wait_until
>     if condition():
>   File "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line 81, in <lambda>
>     wait_until(lambda: self.replica_verifier.get_lag_for_partition(TOPIC, 0) == 0, timeout_sec=10,
>   File "/root/kafka/tests/kafkatest/services/replica_verification_tool.py", line 66, in get_lag_for_partition
>     lag = self.partition_lag[topic_partition]
> KeyError: 'topic-replica-verification,0'
> {noformat}
> Instead of an error, None should be returned when no key is found. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)