You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Purshotam Shah (JIRA)" <ji...@apache.org> on 2014/10/14 19:23:34 UTC

[jira] [Commented] (OOZIE-1896) ZKUUIDService - Too many job submission fails

    [ https://issues.apache.org/jira/browse/OOZIE-1896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14171239#comment-14171239 ] 

Purshotam Shah commented on OOZIE-1896:
---------------------------------------

Response form Curator team
{quote}
PromotedToLock with a reasonable retry should make failures almost never happen. You can also just set the number of retries to a huge number.
{quote}


Patch does....
1. Change the retry logic from ExponentialBackoffRetry to RetryNTimes.
2. If ZKUUIDService can't get ID, it will fall back to parent UUIDService.
3. Fix flaky testResetSequence_withMultiThread



> ZKUUIDService - Too many job submission fails
> ---------------------------------------------
>
>                 Key: OOZIE-1896
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1896
>             Project: Oozie
>          Issue Type: Bug
>          Components: HA
>            Reporter: Purshotam Shah
>            Assignee: Purshotam Shah
>             Fix For: trunk, 4.1.0
>
>
> DistributedAtomicLong.prevalue and postValue  =0 and succeeded = false.
> ZK error.
> Thu Jun 12 22:22:00 2014: 2014-06-12 22:22:00,981 - INFO  [ProcessThread(sid:0 cport:-1)::PrepRequestProcessor@627] - Got user-level KeeperException when processing sessionid:0x145bf4221730116 type:setData cxid:0x2f992 zxid:0x30b14d txntype:-1 reqpath:n/a Error Path:/oozie_test1/job_id_sequence Error:KeeperErrorCode = BadVersion for /oozie_test1/job_id_sequence



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)