You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Andras Salamon (Jira)" <ji...@apache.org> on 2020/02/20 13:03:00 UTC

[jira] [Updated] (OOZIE-3527) Oozie stuck in waiting state if CoordPushDependencyCheckXCommand is not requeued

     [ https://issues.apache.org/jira/browse/OOZIE-3527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andras Salamon updated OOZIE-3527:
----------------------------------
    Fix Version/s:     (was: trunk)
                   5.2.0

> Oozie stuck in waiting state if CoordPushDependencyCheckXCommand is not requeued
> --------------------------------------------------------------------------------
>
>                 Key: OOZIE-3527
>                 URL: https://issues.apache.org/jira/browse/OOZIE-3527
>             Project: Oozie
>          Issue Type: Bug
>            Reporter: Rohini Palaniswamy
>            Assignee: Manasa Gogineni
>            Priority: Major
>             Fix For: 5.2.0
>
>         Attachments: OOZIE-3527-1.patch
>
>
> We had a case where CoordPushDependencyCheckXCommand failed during loadState during a DB service failover to another node. Failure in loadState does not requeue. Ideally RecoveryService should pick it up, but since the CoordActionInputCheckXCommand was also running due to hdfs dependencies and constantly updating the lastModifiedTime, the RecoveryService was also not picking it up. Action was stuck in WAITING forever as the hcat dependencies were never discovered.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)