You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Stephan Ewen (JIRA)" <ji...@apache.org> on 2015/02/15 22:47:11 UTC

[jira] [Commented] (FLINK-1454) CliFrontend blocks for 100 seconds when submitting to a non-existent JobManager

    [ https://issues.apache.org/jira/browse/FLINK-1454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14322206#comment-14322206 ] 

Stephan Ewen commented on FLINK-1454:
-------------------------------------

I have been looking into various ways to solve this, such as monitoring of the actor system connection events. It seems that approach does not work properly after all.

How about we define a separate (rather low) timeout for actor lookups?

I'll also write on the akka mailing list for another solution, but so far, the search was fruitless.

> CliFrontend blocks for 100 seconds when submitting to a non-existent JobManager
> -------------------------------------------------------------------------------
>
>                 Key: FLINK-1454
>                 URL: https://issues.apache.org/jira/browse/FLINK-1454
>             Project: Flink
>          Issue Type: Bug
>          Components: JobManager
>    Affects Versions: 0.9
>            Reporter: Robert Metzger
>
> When a user tries to submit a job to a job manager which doesn't exist at all, the CliFrontend blocks for 100 seconds.
> Ideally, Akka would fail because it can not connect to the given hostname:port.
>  
> {code}
> ./bin/flink run ./examples/flink-java-examples-0.9-SNAPSHOT-WordCount.jar -c foo.Baz
> org.apache.flink.client.program.ProgramInvocationException: The main method caused an error.
> 	at org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:449)
> 	at org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:350)
> 	at org.apache.flink.client.program.Client.run(Client.java:242)
> 	at org.apache.flink.client.CliFrontend.executeProgram(CliFrontend.java:389)
> 	at org.apache.flink.client.CliFrontend.run(CliFrontend.java:362)
> 	at org.apache.flink.client.CliFrontend.parseParameters(CliFrontend.java:1078)
> 	at org.apache.flink.client.CliFrontend.main(CliFrontend.java:1102)
> Caused by: java.util.concurrent.TimeoutException: Futures timed out after [100 seconds]
> 	at scala.concurrent.impl.Promise$DefaultPromise.ready(Promise.scala:219)
> 	at scala.concurrent.impl.Promise$DefaultPromise.result(Promise.scala:223)
> 	at scala.concurrent.Await$$anonfun$result$1.apply(package.scala:107)
> 	at scala.concurrent.BlockContext$DefaultBlockContext$.blockOn(BlockContext.scala:53)
> 	at scala.concurrent.Await$.result(package.scala:107)
> 	at org.apache.flink.runtime.akka.AkkaUtils$.ask(AkkaUtils.scala:265)
> 	at org.apache.flink.runtime.client.JobClient$.uploadJarFiles(JobClient.scala:169)
> 	at org.apache.flink.runtime.client.JobClient.uploadJarFiles(JobClient.scala)
> 	at org.apache.flink.client.program.Client.run(Client.java:314)
> 	at org.apache.flink.client.program.Client.run(Client.java:296)
> 	at org.apache.flink.client.program.Client.run(Client.java:290)
> 	at org.apache.flink.client.program.ContextEnvironment.execute(ContextEnvironment.java:55)
> 	at org.apache.flink.examples.java.wordcount.WordCount.main(WordCount.java:82)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:606)
> 	at org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:434)
> 	... 6 more
> The exception above occurred while trying to run your command.
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)