You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@oozie.apache.org by 任桂禾 <re...@ebupt.com> on 2013/11/21 15:15:03 UTC

JA008 in reruning coordinator action

hi,
 I  encountered JA008 error when reruning a  coordinator action.
 The  already executed action of the workflow job are too expensive to be  re-executed,so I can't rerun the coordinator action ,I rerun it as a workflow  instead .
 In my job. properties, I  set:
 oozie.wf.rerun.failnodes=true  
 or  
 oozie.wf.rerun.skip.nodes=node1,node2,node3……
 But  sometimes some acitons can't rerun,the error info is like this:
 JA008:  File does not exist:  /user/root/oozie-hado/0000139-131111110706796-oozie-hado-W/_1063--shell/shell-launcher.jar
 JA008: File does not exist: /user/root/oozie-hado/0000143-131111110706796-oozie-hado-W/_1041--shell/sh  
  org.apache.oozie.action.ActionExecutorException:  JA008: File does not exist: /user/root/oozie-hado/0000143-131111110706796
 -oozie-hado-W/_1041--shell/shell-launcher.jar
          at  org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecutor.java:412)
          at  org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.java:396)
          at  org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:762)
          at  org.apache.oozie.action.hadoop.JavaActionExecutor.start(JavaActionExecutor.java:913)
          at  org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:211)
          at  org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:59)
          at org.apache.oozie.command.XCommand.call(XCommand.java:277)
          at  org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
          at  java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
          at  java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
          at java.lang.Thread.run(Thread.java:662)
  
 I  am so confused while I was trying to rerun,because sometimes it worked  normally,sometimes it did't,it seemed instable.
 Sometimes  some actions just can't rerun,besides the JA008 ,I also get error info like  this:
   2013-11-20 20:18:55,665  WARN  CallableQueueService$CallableWrapper:542 - USER[-] GROUP[-] TOKEN[-] APP[-]  JOB[-] ACTION[-]
  exception callable [callback], E0800: Action it is  not running its in [FAILED] state, action [0000143-131111110706796-ooz
 ie-hado-W@_1041]
 org.apache.oozie.command.CommandException: E0800:  Action it is not running its in [FAILED] state, action [0000143-13111111
 0706796-oozie-hado-W@_1041]
         at  org.apache.oozie.command.wf.CompletedActionXCommand.eagerVerifyPrecondition(CompletedActionXCommand.java:85)
         at  org.apache.oozie.command.XCommand.call(XCommand.java:248)
         at  org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
         at  java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
         at  java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
         at  java.lang.Thread.run(Thread.java:662)


 2013-11-21  21:32:31,816  WARN ActionStartXCommand:542 - USER[root] GROUP[-] TOKEN[]  APP[1114-left] JOB[0000139-131111110706796-oozie-hado-W]  ACTION[0000139-131111110706796-oozie-hado-W@_1180] E0816: Action  pending=[false], status=[FAILED]. Skipping ActionStart Execution, Error Code:  E0816
  
 Can  somebody help me out ?
 thx!