You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "Mona Chitnis (JIRA)" <ji...@apache.org> on 2013/09/03 23:55:52 UTC
[jira] [Updated] (OOZIE-1513) Workflow stays in running if
Fork/join validation or loop detection fails
[ https://issues.apache.org/jira/browse/OOZIE-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mona Chitnis updated OOZIE-1513:
--------------------------------
Attachment: OOZIE-1513.patch
Tested with end-to-end test. Unit test doesnt help replicate the same situation i.e. nodeHandler throwing WorkflowException
> Workflow stays in running if Fork/join validation or loop detection fails
> -------------------------------------------------------------------------
>
> Key: OOZIE-1513
> URL: https://issues.apache.org/jira/browse/OOZIE-1513
> Project: Oozie
> Issue Type: Bug
> Affects Versions: trunk
> Reporter: Mona Chitnis
> Assignee: Mona Chitnis
> Fix For: trunk
>
> Attachments: OOZIE-1513.patch
>
> Original Estimate: 6h
> Remaining Estimate: 6h
>
> If some jobs have configured their workflow definition wrongly with improper use of fork-join combination, in some occurrences the jobs did not go to FAILED. Recovery service keeps picking up and running them again and again, so log is full of errors.
> {code}
> 2012-10-03 19:40:54,035 ERROR SignalXCommand:536 - USER[joe] GROUP[users] TOKEN[-] APP[my-oozie-app] JOB[0001800-120927185459177-oozie-wrkf-W] ACTION[0001800-120927185459177-oozie-wrkf-W@streaming-job] XException,
> org.apache.oozie.command.CommandException: E0720: Fork/join mismatch, node [join]
> at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:165)
> at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:63)
> at org.apache.oozie.command.XCommand.call(XCommand.java:277)
> at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
> at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
> at java.lang.Thread.run(Thread.java:662)
> Caused by: org.apache.oozie.workflow.WorkflowException: E0720: Fork/join mismatch, node [join]
> at org.apache.oozie.workflow.lite.JoinNodeDef$JoinNodeHandler.loopDetection(JoinNodeDef.java:47)
> at org.apache.oozie.workflow.lite.LiteWorkflowInstance.signal(LiteWorkflowInstance.java:206)
> at org.apache.oozie.workflow.lite.LiteWorkflowInstance.signal(LiteWorkflowInstance.java:287)
> at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:162)
> ... 6 more
> {code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira