You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "hongbin ma (JIRA)" <ji...@apache.org> on 2016/07/20 03:43:20 UTC

[jira] [Commented] (KYLIN-1853) MR build job has exceptions in logs but shows SUCCESS finally

    [ https://issues.apache.org/jira/browse/KYLIN-1853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15385271#comment-15385271 ] 

hongbin ma commented on KYLIN-1853:
-----------------------------------

the issue is somehow connected to https://issues.apache.org/jira/browse/KYLIN-1684, where the root cause was a failure to get counters from history server

> MR build job has exceptions in logs but shows SUCCESS finally
> -------------------------------------------------------------
>
>                 Key: KYLIN-1853
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1853
>             Project: Kylin
>          Issue Type: Bug
>          Components: Job Engine
>    Affects Versions: v1.5.2
>            Reporter: Billy(Yiming) Liu
>            Assignee: Dong Li
>            Priority: Minor
>
> Found a few ConnectionRefused exceptions during MR job build, finally the cube contains no data, but stills shows SUCCESS. It should be FAIL.
> 2016-07-06 07:53:42,626 DEBUG [pool-5-thread-2] hbase.HBaseResourceStore:262 : Update row /execute_output/1208a2e0-8696-4fa6-a3e7-bb7d16ac5be8-01 from oldTs: 1467791612594, to newTs: 1467791622620, operation result: true
> 2016-07-06 07:54:01,216 INFO  [pool-4-thread-1] threadpool.DefaultScheduler:106 : Job Fetcher: 1 running, 1 actual running, 0 ready, 0 others
> 2016-07-06 07:54:13,037 ERROR [pool-5-thread-2] common.HadoopCmdOutput:100 : java.net.ConnectException: Call From BDP-01/192.168.0.184 to 0.0.0.0:10020 failed on connection exception: java.net.ConnectException: Connection refused; For more details see:  http://wiki.apache.org/hadoop/ConnectionRefused
> java.io.IOException: java.net.ConnectException: Call From BDP-01/192.168.0.184 to 0.0.0.0:10020 failed on connection exception: java.net.ConnectException: Connection refused; For more details see:  http://wiki.apache.org/hadoop/ConnectionRefused
>                 at org.apache.hadoop.mapred.ClientServiceDelegate.invoke(ClientServiceDelegate.java:337)
>                 at org.apache.hadoop.mapred.ClientServiceDelegate.getJobCounters(ClientServiceDelegate.java:374)
>                 at org.apache.hadoop.mapred.YARNRunner.getJobCounters(YARNRunner.java:564)
>                 at org.apache.hadoop.mapreduce.Job$7.run(Job.java:767)
>                 at org.apache.hadoop.mapreduce.Job$7.run(Job.java:764)
>                 at java.security.AccessController.doPrivileged(Native Method)
>                 at javax.security.auth.Subject.doAs(Subject.java:415)
>                 at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
>                 at org.apache.hadoop.mapreduce.Job.getCounters(Job.java:764)
>                 at org.apache.kylin.engine.mr.common.HadoopCmdOutput.updateJobCounter(HadoopCmdOutput.java:86)
>                 at org.apache.kylin.engine.mr.common.MapReduceExecutable.readCounters(MapReduceExecutable.java:196)
>                 at org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork(MapReduceExecutable.java:163)
>                 at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:114)
>                 at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
>                 at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:114)
>                 at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:124)
>                 at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>                 at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>                 at java.lang.Thread.run(Thread.java:745)
> Caused by: java.net.ConnectException: Call From BDP-01/192.168.0.184 to 0.0.0.0:10020 failed on connection exception: java.net.ConnectException: Connection refused; For more details see:  http://wiki.apache.org/hadoop/ConnectionRefused
>                 at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>                 at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
>                 at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>                 at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
>                 at org.apache.hadoop.net.NetUtils.wrapWithMessage(NetUtils.java:791)
>                 at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:731)
>                 at org.apache.hadoop.ipc.Client.call(Client.java:1472)
>                 at org.apache.hadoop.ipc.Client.call(Client.java:1399)
>                 at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:232)
>                 at com.sun.proxy.$Proxy49.getCounters(Unknown Source)
>                 at org.apache.hadoop.mapreduce.v2.api.impl.pb.client.MRClientProtocolPBClientImpl.getCounters(MRClientProtocolPBClientImpl.java:166)
>                 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>                 at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>                 at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>                 at java.lang.reflect.Method.invoke(Method.java:606)
>                 at org.apache.hadoop.mapred.ClientServiceDelegate.invoke(ClientServiceDelegate.java:323)
>                 ... 18 more
> Caused by: java.net.ConnectException: Connection refused
>                 at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
>                 at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
>                 at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
>                 at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:530)
>                 at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:494)
>                 at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:607)
>                 at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:705)
>                 at org.apache.hadoop.ipc.Client$Connection.access$2800(Client.java:368)
>                 at org.apache.hadoop.ipc.Client.getConnection(Client.java:1521)
>                 at org.apache.hadoop.ipc.Client.call(Client.java:1438)
>                 ... 27 more
> 2016-07-06 07:54:13,062 DEBUG [pool-5-thread-2] hbase.HBaseResourceStore:262 : Update row /execute_output/1208a2e0-8696-4fa6-a3e7-bb7d16ac5be8-01 from oldTs: 1467791622620, to newTs: 1467791653055, operation result: true
> 2016-07-06 07:54:13,070 DEBUG [pool-5-thread-2] hbase.HBaseResourceStore:262 : Update row /execute_output/1208a2e0-8696-4fa6-a3e7-bb7d16ac5be8-01 from oldTs: 1467791653055, to newTs: 1467791653065, operation result: true
> 2016-07-06 07:54:13,080 DEBUG [pool-5-thread-2] hbase.HBaseResourceStore:262 : Update row /execute_output/1208a2e0-8696-4fa6-a3e7-bb7d16ac5be8-01 from oldTs: 1467791653065, to newTs: 1467791653075, operation result: true
> 2016-07-06 07:54:13,080 INFO  [pool-5-thread-2] manager.ExecutableManager:274 : job id:1208a2e0-8696-4fa6-a3e7-bb7d16ac5be8-01 from RUNNING to SUCCEED



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)