You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Nigel Daley (JIRA)" <ji...@apache.org> on 2007/03/19 19:48:32 UTC

[jira] Updated: (HADOOP-1132) Job submission RPC more likely to timeout and fail

     [ https://issues.apache.org/jira/browse/HADOOP-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nigel Daley updated HADOOP-1132:
--------------------------------

        Fix Version/s:     (was: 0.12.1)
    Affects Version/s: 0.12.1

> Job submission RPC more likely to timeout and fail
> --------------------------------------------------
>
>                 Key: HADOOP-1132
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1132
>             Project: Hadoop
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.12.1
>            Reporter: Nigel Daley
>
> For the past week, Small Jobs Benchmark has been failing sporadically when run on small, medium and large size clusters.  I run it with 30 iterations.  It usually fails somewhere between the 12th and 20th iteration with a timeout trying to submit the job.  Perhaps the JT is busier now doing counter related work which causes this to timeout.  Here's the client side exception:
> 07/03/19 13:57:41 INFO mapred.MRBench: Running job 13: input=/MRBench/mr_output/output_-1811371677
> 07/03/19 13:57:41 INFO mapred.InputFormatBase: Total input paths to process : 1
> java.net.SocketTimeoutException: timed out waiting for rpc response
>         at org.apache.hadoop.ipc.Client.call(Client.java:473)
>         at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:163)
>         at $Proxy1.submitJob(Unknown Source)
>         at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:376)
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:543)
>         at org.apache.hadoop.mapred.MRBench.runJobInSequence(MRBench.java:188)
>         at org.apache.hadoop.mapred.MRBench.main(MRBench.java:280)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:71)
>         at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:143)
>         at org.apache.hadoop.test.AllTestDriver.main(AllTestDriver.java:64)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.apache.hadoop.util.RunJar.main(RunJar.java:155)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.