You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zookeeper.apache.org by "Thawan Kooburat (JIRA)" <ji...@apache.org> on 2012/11/02 00:14:12 UTC

[jira] [Created] (ZOOKEEPER-1573) Unable to load database due to missing parent node

Thawan Kooburat created ZOOKEEPER-1573:
------------------------------------------

             Summary: Unable to load database due to missing parent node
                 Key: ZOOKEEPER-1573
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1573
             Project: ZooKeeper
          Issue Type: Bug
          Components: server
    Affects Versions: 3.4.3
            Reporter: Thawan Kooburat


While replaying txnlog on data tree, the server has a code to detect missing parent node. This code block was last modified as part of ZOOKEEPER-1333. In our production, we found a case where this check is return false positive.

The sequence of txns is as follows:

zxid 1:  create /prefix/a
zxid 2:  create /prefix/a/b
zxid 3:  delete /prefix/a/b
zxid 4:  delete /prefix/a

The server start capturing snapshot at zxid 1. However, by the time it traversing the data tree down to /prefix, txn 4 is already applied and /prefix have no children. 

When the server restore from snapshot, it process txnlog starting from zxid 2. This txn generate missing parent error and the server refuse to start up.

The same check allow me to discover bug in ZOOKEEPER-1551, but I don't know if we have any option beside removing this check to solve this issue.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira