You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@oozie.apache.org by 任桂禾 <re...@ebupt.com> on 2013/11/21 15:15:03 UTC
JA008 in reruning coordinator action
hi,
I encountered JA008 error when reruning a coordinator action.
The already executed action of the workflow job are too expensive to be re-executed,so I can't rerun the coordinator action ,I rerun it as a workflow instead .
In my job. properties, I set:
oozie.wf.rerun.failnodes=true
or
oozie.wf.rerun.skip.nodes=node1,node2,node3……
But sometimes some acitons can't rerun,the error info is like this:
JA008: File does not exist: /user/root/oozie-hado/0000139-131111110706796-oozie-hado-W/_1063--shell/shell-launcher.jar
JA008: File does not exist: /user/root/oozie-hado/0000143-131111110706796-oozie-hado-W/_1041--shell/sh
org.apache.oozie.action.ActionExecutorException: JA008: File does not exist: /user/root/oozie-hado/0000143-131111110706796
-oozie-hado-W/_1041--shell/shell-launcher.jar
at org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecutor.java:412)
at org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.java:396)
at org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:762)
at org.apache.oozie.action.hadoop.JavaActionExecutor.start(JavaActionExecutor.java:913)
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:211)
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:59)
at org.apache.oozie.command.XCommand.call(XCommand.java:277)
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
I am so confused while I was trying to rerun,because sometimes it worked normally,sometimes it did't,it seemed instable.
Sometimes some actions just can't rerun,besides the JA008 ,I also get error info like this:
2013-11-20 20:18:55,665 WARN CallableQueueService$CallableWrapper:542 - USER[-] GROUP[-] TOKEN[-] APP[-] JOB[-] ACTION[-]
exception callable [callback], E0800: Action it is not running its in [FAILED] state, action [0000143-131111110706796-ooz
ie-hado-W@_1041]
org.apache.oozie.command.CommandException: E0800: Action it is not running its in [FAILED] state, action [0000143-13111111
0706796-oozie-hado-W@_1041]
at org.apache.oozie.command.wf.CompletedActionXCommand.eagerVerifyPrecondition(CompletedActionXCommand.java:85)
at org.apache.oozie.command.XCommand.call(XCommand.java:248)
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
2013-11-21 21:32:31,816 WARN ActionStartXCommand:542 - USER[root] GROUP[-] TOKEN[] APP[1114-left] JOB[0000139-131111110706796-oozie-hado-W] ACTION[0000139-131111110706796-oozie-hado-W@_1180] E0816: Action pending=[false], status=[FAILED]. Skipping ActionStart Execution, Error Code: E0816
Can somebody help me out ?
thx!