You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "Shaofeng SHI (JIRA)" <ji...@apache.org> on 2015/11/24 07:14:10 UTC

[jira] [Commented] (KYLIN-1149) When yarn return an incomplete job tracking URL, Kylin will fail to get job status

    [ https://issues.apache.org/jira/browse/KYLIN-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15023840#comment-15023840 ] 

Shaofeng SHI commented on KYLIN-1149:
-------------------------------------

RM can't find the active RM, the host will be empty. This will happen when 2 RM are both standby, or RPC timeout when finding active RM. See more in https://github.com/apache/hadoop/blob/branch-2.4.1/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/RMWebAppFilter.java 

As a hot-fix, Kylin will retry when geting such an incomplete URL.

> When yarn return an incomplete job tracking URL, Kylin will fail to get job status
> ----------------------------------------------------------------------------------
>
>                 Key: KYLIN-1149
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1149
>             Project: Kylin
>          Issue Type: Bug
>          Components: Job Engine
>    Affects Versions: v1.1, v1.0, v0.7.2, v0.7.1, v1.1.1
>            Reporter: Shaofeng SHI
>            Assignee: Shaofeng SHI
>             Fix For: v2.0, 1.2
>
>
> When yarn return an incomplete job tracking URL, Kylin will fail to get job status, then the cube job will be in "Error" state:
> {code}
> [pool-9-thread-8]:[2015-11-16 04:58:32,062][ERROR][org.apache.kylin.engine.mr.common.HadoopStatusChecker.checkStatus(HadoopStatusChecker.java:91)] - error check status
> java.lang.IllegalArgumentException: host parameter is null
>     at org.apache.commons.httpclient.HttpConnection.setHost(HttpConnection.java:249)
>     at org.apache.commons.httpclient.SimpleHttpConnectionManager.getConnectionWithTimeout(SimpleHttpConnectionManager.java:189)
>     at org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:153)
>     at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
>     at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
>     at org.apache.kylin.engine.mr.common.HadoopStatusGetter.getHttpResponse(HadoopStatusGetter.java:77)
>     at org.apache.kylin.engine.mr.common.HadoopStatusGetter.get(HadoopStatusGetter.java:54)
>     at org.apache.kylin.engine.mr.common.HadoopStatusChecker.checkStatus(HadoopStatusChecker.java:56)
>     at org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork(MapReduceExecutable.java:136)
>     at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
>     at org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
>     at org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:107)
>     at org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:124)
>     at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>     at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>     at java.lang.Thread.run(Thread.java:745)
> {code}
> Kylin should better check whether the URL is valid before visit it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)