You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@oozie.apache.org by Giridhar Addepalli <gi...@gmail.com> on 2013/11/19 08:26:24 UTC

java.io.EOFException while submitting job

Hi All,

We are using 3.3.0 version of Oozie.
Trying to run mapr-reduce app from example/apps that gets shipped with
oozie.

We are getting following error::

2013-11-18 23:20:41,156  INFO ActionStartXCommand:539 - USER[gaddepa]
GROUP[-] TOKEN[] APP[map-reduce-wf]
JOB[0000985-131031060610521-oozie-root-W]
ACTION[0000985-131031060610521-oozie-root-W@:start:] Start action
[0000985-131031060610521-oozie-root-W@:start:] with user-retry state :
userRetryCount [0], userRetryMax [0], userRetryInterval [10]
2013-11-18 23:20:41,157  WARN ActionStartXCommand:542 - USER[gaddepa]
GROUP[-] TOKEN[] APP[map-reduce-wf]
JOB[0000985-131031060610521-oozie-root-W]
ACTION[0000985-131031060610521-oozie-root-W@:start:]
[***0000985-131031060610521-oozie-root-W@:start:***]Action status=DONE
2013-11-18 23:20:41,157  WARN ActionStartXCommand:542 - USER[gaddepa]
GROUP[-] TOKEN[] APP[map-reduce-wf]
JOB[0000985-131031060610521-oozie-root-W]
ACTION[0000985-131031060610521-oozie-root-W@:start:]
[***0000985-131031060610521-oozie-root-W@:start:***]Action updated in DB!
2013-11-18 23:20:41,479  INFO ActionStartXCommand:539 - USER[gaddepa]
GROUP[-] TOKEN[] APP[map-reduce-wf]
JOB[0000985-131031060610521-oozie-root-W]
ACTION[0000985-131031060610521-oozie-root-W@mr-node] Start action
[0000985-131031060610521-oozie-root-W@mr-node] with user-retry state :
userRetryCount [0], userRetryMax [0], userRetryInterval [10]
2013-11-18 23:20:42,497  WARN MapReduceActionExecutor:542 - USER[gaddepa]
GROUP[-] TOKEN[] APP[map-reduce-wf]
JOB[0000985-131031060610521-oozie-root-W]
ACTION[0000985-131031060610521-oozie-root-W@mr-node] credentials is null
for the action
2013-11-18 23:20:43,974  WARN ActionStartXCommand:542 - USER[gaddepa]
GROUP[-] TOKEN[] APP[map-reduce-wf]
JOB[0000985-131031060610521-oozie-root-W]
ACTION[0000985-131031060610521-oozie-root-W@mr-node] Error starting action
[mr-node]. ErrorType [TRANSIENT], ErrorCode [JA009], Message [JA009: null]
org.apache.oozie.action.ActionExecutorException: JA009: null
        at
org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecutor.java:418)
        at
org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.java:396)
        at
org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:739)
        at
org.apache.oozie.action.hadoop.JavaActionExecutor.start(JavaActionExecutor.java:889)
        at
org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:211)
        at
org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:59)
        at org.apache.oozie.command.XCommand.call(XCommand.java:277)
        at
org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:326)
        at
org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:255)
        at
org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
        at java.lang.Thread.run(Thread.java:662)
Caused by: java.io.EOFException
        at java.io.DataInputStream.readInt(DataInputStream.java:375)
        at org.apache.hadoop.mapreduce.ID.readFields(ID.java:83)
        at org.apache.hadoop.mapreduce.JobID.readFields(JobID.java:122)
        at
org.apache.oozie.service.MaprJobClient.submitJob(MaprJobClient.java:292)
        at
org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActionExecutor.java:724)
        ... 10 more
2013-11-18 23:20:43,976  INFO ActionStartXCommand:539 - USER[gaddepa]
GROUP[-] TOKEN[] APP[map-reduce-wf]
JOB[0000985-131031060610521-oozie-root-W]
ACTION[0000985-131031060610521-oozie-root-W@mr-node] Next Retry, Attempt
Number [1] in [60,000] milliseconds


Our oozie server was running fine and we were able to submit jobs and they
were running . All of  a sudden from 2 days we are getting this error.
Any known reason behind this error?

Thanks,
Giridhar.

Re: java.io.EOFException while submitting job

Posted by Mona Chitnis <ch...@yahoo-inc.com>.
I have seen this cryptic error thrown in the context of having wrong
JobTracker/ResourceManager port in the job config. Please check that one.

On 11/18/13 11:26 PM, "Giridhar Addepalli" <gi...@gmail.com> wrote:

>Hi All,
>
>We are using 3.3.0 version of Oozie.
>Trying to run mapr-reduce app from example/apps that gets shipped with
>oozie.
>
>We are getting following error::
>
>2013-11-18 23:20:41,156  INFO ActionStartXCommand:539 - USER[gaddepa]
>GROUP[-] TOKEN[] APP[map-reduce-wf]
>JOB[0000985-131031060610521-oozie-root-W]
>ACTION[0000985-131031060610521-oozie-root-W@:start:] Start action
>[0000985-131031060610521-oozie-root-W@:start:] with user-retry state :
>userRetryCount [0], userRetryMax [0], userRetryInterval [10]
>2013-11-18 23:20:41,157  WARN ActionStartXCommand:542 - USER[gaddepa]
>GROUP[-] TOKEN[] APP[map-reduce-wf]
>JOB[0000985-131031060610521-oozie-root-W]
>ACTION[0000985-131031060610521-oozie-root-W@:start:]
>[***0000985-131031060610521-oozie-root-W@:start:***]Action status=DONE
>2013-11-18 23:20:41,157  WARN ActionStartXCommand:542 - USER[gaddepa]
>GROUP[-] TOKEN[] APP[map-reduce-wf]
>JOB[0000985-131031060610521-oozie-root-W]
>ACTION[0000985-131031060610521-oozie-root-W@:start:]
>[***0000985-131031060610521-oozie-root-W@:start:***]Action updated in DB!
>2013-11-18 23:20:41,479  INFO ActionStartXCommand:539 - USER[gaddepa]
>GROUP[-] TOKEN[] APP[map-reduce-wf]
>JOB[0000985-131031060610521-oozie-root-W]
>ACTION[0000985-131031060610521-oozie-root-W@mr-node] Start action
>[0000985-131031060610521-oozie-root-W@mr-node] with user-retry state :
>userRetryCount [0], userRetryMax [0], userRetryInterval [10]
>2013-11-18 23:20:42,497  WARN MapReduceActionExecutor:542 - USER[gaddepa]
>GROUP[-] TOKEN[] APP[map-reduce-wf]
>JOB[0000985-131031060610521-oozie-root-W]
>ACTION[0000985-131031060610521-oozie-root-W@mr-node] credentials is null
>for the action
>2013-11-18 23:20:43,974  WARN ActionStartXCommand:542 - USER[gaddepa]
>GROUP[-] TOKEN[] APP[map-reduce-wf]
>JOB[0000985-131031060610521-oozie-root-W]
>ACTION[0000985-131031060610521-oozie-root-W@mr-node] Error starting action
>[mr-node]. ErrorType [TRANSIENT], ErrorCode [JA009], Message [JA009: null]
>org.apache.oozie.action.ActionExecutorException: JA009: null
>        at
>org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecut
>or.java:418)
>        at
>org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.jav
>a:396)
>        at
>org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActio
>nExecutor.java:739)
>        at
>org.apache.oozie.action.hadoop.JavaActionExecutor.start(JavaActionExecutor
>.java:889)
>        at
>org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXComman
>d.java:211)
>        at
>org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXComman
>d.java:59)
>        at org.apache.oozie.command.XCommand.call(XCommand.java:277)
>        at
>org.apache.oozie.service.CallableQueueService$CompositeCallable.call(Calla
>bleQueueService.java:326)
>        at
>org.apache.oozie.service.CallableQueueService$CompositeCallable.call(Calla
>bleQueueService.java:255)
>        at
>org.apache.oozie.service.CallableQueueService$CallableWrapper.run(Callable
>QueueService.java:175)
>        at
>java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.
>java:886)
>        at
>java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java
>:908)
>        at java.lang.Thread.run(Thread.java:662)
>Caused by: java.io.EOFException
>        at java.io.DataInputStream.readInt(DataInputStream.java:375)
>        at org.apache.hadoop.mapreduce.ID.readFields(ID.java:83)
>        at org.apache.hadoop.mapreduce.JobID.readFields(JobID.java:122)
>        at
>org.apache.oozie.service.MaprJobClient.submitJob(MaprJobClient.java:292)
>        at
>org.apache.oozie.action.hadoop.JavaActionExecutor.submitLauncher(JavaActio
>nExecutor.java:724)
>        ... 10 more
>2013-11-18 23:20:43,976  INFO ActionStartXCommand:539 - USER[gaddepa]
>GROUP[-] TOKEN[] APP[map-reduce-wf]
>JOB[0000985-131031060610521-oozie-root-W]
>ACTION[0000985-131031060610521-oozie-root-W@mr-node] Next Retry, Attempt
>Number [1] in [60,000] milliseconds
>
>
>Our oozie server was running fine and we were able to submit jobs and they
>were running . All of  a sudden from 2 days we are getting this error.
>Any known reason behind this error?
>
>Thanks,
>Giridhar.