You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Flavio Junqueira (JIRA)" <ji...@apache.org> on 2016/10/05 09:03:21 UTC

[jira] [Comment Edited] (KAFKA-3985) Transient system test failure ZooKeeperSecurityUpgradeTest.test_zk_security_upgrade.security_protocol

    [ https://issues.apache.org/jira/browse/KAFKA-3985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15548125#comment-15548125 ] 

Flavio Junqueira edited comment on KAFKA-3985 at 10/5/16 9:02 AM:
------------------------------------------------------------------

[~rsivaram]

bq. I wouldn't have expected to see a CA with start time 09:55:15 at all. Is it possible that there was another system test running on that same host (the tests use a fixed file name for CA and truststore, so the tests would fail if there are multiple instances of system tests or any command that loads the security config class that is run on that host).

That's a very good point, I'm thinking that the problem is that the CA files are in {{/tmp}}: 

{noformat}
        self.ca_crt_path = "/tmp/test.ca.crt"
        self.ca_jks_path = "/tmp/test.ca.jks"
{noformat}

and perhaps we should use {{mkdtemp}} like in {{generate_and_copy_keystore}}. Does it make sense?



was (Author: fpj):
[~rsivaram]

.bq I wouldn't have expected to see a CA with start time 09:55:15 at all. Is it possible that there was another system test running on that same host (the tests use a fixed file name for CA and truststore, so the tests would fail if there are multiple instances of system tests or any command that loads the security config class that is run on that host).

That's a very good point, I'm thinking that the problem is that the CA files are in {{/tmp}}: 

{noformat}
        self.ca_crt_path = "/tmp/test.ca.crt"
        self.ca_jks_path = "/tmp/test.ca.jks"
{noformat}

and perhaps we should use {{mkdtemp}} like in {{generate_and_copy_keystore}}. Does it make sense?


> Transient system test failure ZooKeeperSecurityUpgradeTest.test_zk_security_upgrade.security_protocol
> -----------------------------------------------------------------------------------------------------
>
>                 Key: KAFKA-3985
>                 URL: https://issues.apache.org/jira/browse/KAFKA-3985
>             Project: Kafka
>          Issue Type: Test
>          Components: system tests
>    Affects Versions: 0.10.0.0
>            Reporter: Jason Gustafson
>
> Found this in the nightly build on the 0.10.0 branch. Full details here: http://testing.confluent.io/confluent-kafka-0-10-0-system-test-results/?prefix=2016-07-22--001.1469199875--apache--0.10.0--71a598a/.  
> {code}
> test_id:    2016-07-22--001.kafkatest.tests.core.zookeeper_security_upgrade_test.ZooKeeperSecurityUpgradeTest.test_zk_security_upgrade.security_protocol=SSL
> status:     FAIL
> run time:   5 minutes 14.067 seconds
>     292 acked message did not make it to the Consumer. They are: 11264, 11265, 11266, 11267, 11268, 11269, 11270, 11271, 11272, 11273, 11274, 11275, 11276, 11277, 11278, 11279, 11280, 11281, 11282, 11283, ...plus 252 more. Total Acked: 11343, Total Consumed: 11054. We validated that the first 272 of these missing messages correctly made it into Kafka's data files. This suggests they were lost on their way to the consumer.
> Traceback (most recent call last):
>   File "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/venv/local/lib/python2.7/site-packages/ducktape/tests/runner.py", line 106, in run_all_tests
>     data = self.run_single_test()
>   File "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/venv/local/lib/python2.7/site-packages/ducktape/tests/runner.py", line 162, in run_single_test
>     return self.current_test_context.function(self.current_test)
>   File "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/venv/local/lib/python2.7/site-packages/ducktape/mark/_mark.py", line 331, in wrapper
>     return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
>   File "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/tests/kafkatest/tests/core/zookeeper_security_upgrade_test.py", line 115, in test_zk_security_upgrade
>     self.run_produce_consume_validate(self.run_zk_migration)
>   File "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/tests/kafkatest/tests/produce_consume_validate.py", line 79, in run_produce_consume_validate
>     raise e
> AssertionError: 292 acked message did not make it to the Consumer. They are: 11264, 11265, 11266, 11267, 11268, 11269, 11270, 11271, 11272, 11273, 11274, 11275, 11276, 11277, 11278, 11279, 11280, 11281, 11282, 11283, ...plus 252 more. Total Acked: 11343, Total Consumed: 11054. We validated that the first 272 of these missing messages correctly made it into Kafka's data files. This suggests they were lost on their way to the consumer.
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)