You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Nick Dimiduk (Jira)" <ji...@apache.org> on 2020/03/19 23:21:00 UTC

[jira] [Resolved] (HBASE-23985) [flakey test] TestZooKeeper

     [ https://issues.apache.org/jira/browse/HBASE-23985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Dimiduk resolved HBASE-23985.
----------------------------------
    Resolution: Duplicate

I think the root cause here is fixed by [~stack]'s work on HBASE-23984.

> [flakey test] TestZooKeeper
> ---------------------------
>
>                 Key: HBASE-23985
>                 URL: https://issues.apache.org/jira/browse/HBASE-23985
>             Project: HBase
>          Issue Type: Test
>          Components: test
>    Affects Versions: 3.0.0
>            Reporter: Nick Dimiduk
>            Priority: Major
>         Attachments: TEST-org.apache.hadoop.hbase.TestZooKeeper.xml
>
>
> I observed a test failure in {{TestZooKeeper#testMasterSessionExpired}} on my local rig. On a casual read of the logs from {{testMasterSessionExpired}}, it appears we have a faulty assumption related to master MTTR; the master abort is logged ~1250ms after ZK session close, which seems entirely too fast to me. Once the master aborts, the damage is done and the test cannot recover.
> The first re-run passes. Surefire does not keep logs of successful tests, so I don't know the timing between events in the successful run.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)