You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Mona Chitnis (JIRA)" <ji...@apache.org> on 2013/09/03 23:55:52 UTC

[jira] [Updated] (OOZIE-1513) Workflow stays in running if Fork/join validation or loop detection fails

     [ https://issues.apache.org/jira/browse/OOZIE-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mona Chitnis updated OOZIE-1513:
--------------------------------

    Attachment: OOZIE-1513.patch

Tested with end-to-end test. Unit test doesnt help replicate the same situation i.e. nodeHandler throwing WorkflowException
                
> Workflow stays in running if Fork/join validation or loop detection fails
> -------------------------------------------------------------------------
>
>                 Key: OOZIE-1513
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1513
>             Project: Oozie
>          Issue Type: Bug
>    Affects Versions: trunk
>            Reporter: Mona Chitnis
>            Assignee: Mona Chitnis
>             Fix For: trunk
>
>         Attachments: OOZIE-1513.patch
>
>   Original Estimate: 6h
>  Remaining Estimate: 6h
>
> If some jobs have configured their workflow definition wrongly with improper use of fork-join combination, in some occurrences the jobs did not go to FAILED. Recovery service keeps picking up and running them again and again, so log is full of errors.
> {code}
> 2012-10-03 19:40:54,035 ERROR SignalXCommand:536 - USER[joe] GROUP[users] TOKEN[-] APP[my-oozie-app] JOB[0001800-120927185459177-oozie-wrkf-W] ACTION[0001800-120927185459177-oozie-wrkf-W@streaming-job] XException, 
> org.apache.oozie.command.CommandException: E0720: Fork/join mismatch, node [join]
> 	at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:165)
> 	at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:63)
> 	at org.apache.oozie.command.XCommand.call(XCommand.java:277)
> 	at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
> 	at java.lang.Thread.run(Thread.java:662)
> Caused by: org.apache.oozie.workflow.WorkflowException: E0720: Fork/join mismatch, node [join]
> 	at org.apache.oozie.workflow.lite.JoinNodeDef$JoinNodeHandler.loopDetection(JoinNodeDef.java:47)
> 	at org.apache.oozie.workflow.lite.LiteWorkflowInstance.signal(LiteWorkflowInstance.java:206)
> 	at org.apache.oozie.workflow.lite.LiteWorkflowInstance.signal(LiteWorkflowInstance.java:287)
> 	at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:162)
> 	... 6 more
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira