You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@qpid.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2015/03/25 11:57:52 UTC

[jira] [Commented] (QPID-6464) HA hangs for duration of replica conistency policy timeout if the node changes role whilst recovery is in flight

    [ https://issues.apache.org/jira/browse/QPID-6464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14379691#comment-14379691 ] 

ASF subversion and git services commented on QPID-6464:
-------------------------------------------------------

Commit 1669092 from orudyy@apache.org in branch 'qpid/trunk'
[ https://svn.apache.org/r1669092 ]

QPID-6464: Set replica consistency policy to 'NoConsistencyPolicy' in order to avoid hanging for timeout specified in TimeConsistencyPolicy on creation of JE transaction after transition from Master into Detached state when HA claster has no majority but the remaining Master change configuration tasks attempted to execute

> HA hangs for duration of replica conistency policy timeout if the node changes role whilst recovery is in flight
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: QPID-6464
>                 URL: https://issues.apache.org/jira/browse/QPID-6464
>             Project: Qpid
>          Issue Type: Bug
>          Components: Java Broker
>    Affects Versions: 6.0 [Java]
>            Reporter: Alex Rudyy
>            Assignee: Alex Rudyy
>         Attachments: apparently-hung-stack.txt
>
>
> On node transition from Master state some in-flight pending configuration change tasks might be attempted to execute after Master transited into Detached/Replica state when node has not been synchronized with new Master or new Master has not been elected yet. On trying to execute configuration change tasks  BDB HA Store can try to create a JE transaction. The creation of JE transaction can hang for a duration of timeout interval which is configured in TimeConsistencyPolicy. The following database write operation would cause ReplicaWriteException and transaction abort. 
> The store should fail immediately in this case rather then pointlessly waiting for some time before failing anyway. More over, a pointless waiting blocks configuration thread making it impossible to perform any other configuration tasks
> Here is the stack trace for a problem
> {noformat}
> at com.sleepycat.je.rep.txn.ReadonlyTxn.<init>(ReadonlyTxn.java:39)
> at com.sleepycat.je.rep.impl.RepImpl.createRepUserTxn(RepImpl.java:924)
> at com.sleepycat.je.txn.Txn.createUserTxn(Txn.java:301)
> at com.sleepycat.je.txn.TxnManager.txnBegin(TxnManager.java:182)
> at com.sleepycat.je.dbi.EnvironmentImpl.txnBegin(EnvironmentImpl.java:2366)
> at com.sleepycat.je.Environment.beginTransactionInternal(Environment.java:1437)
> at com.sleepycat.je.Environment.beginTransaction(Environment.java:1319)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@qpid.apache.org
For additional commands, e-mail: dev-help@qpid.apache.org