You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@livy.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/02/07 18:59:00 UTC
[jira] [Commented] (LIVY-372) Livy Doesn't work with Spark on Mesos
in cluster mode
[ https://issues.apache.org/jira/browse/LIVY-372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16355887#comment-16355887 ]
Lewis John McGibbney commented on LIVY-372:
-------------------------------------------
[~ajbozarth] can you provide a pointer as to where to start here? Below I describe my progress so far.
# The primary property one would set is _livy.spark.master_, this is then used in [LivyConf.scala|https://github.com/apache/incubator-livy/blob/ef5dccbb0d15797a4bc009b78c88e3b40ff79b7d/server/src/main/scala/org/apache/livy/LivyConf.scala#L45],
# the [Spark Master|https://github.com/apache/incubator-livy/blob/ef5dccbb0d15797a4bc009b78c88e3b40ff79b7d/server/src/main/scala/org/apache/livy/LivyConf.scala#L259] instance is then obtained from this property
# [This instance is then used|https://github.com/apache/incubator-livy/blob/ef5dccbb0d15797a4bc009b78c88e3b40ff79b7d/server/src/main/scala/org/apache/livy/LivyConf.scala#L249-L250] in [LivyServer.scala and SparkApp.scala|https://github.com/apache/incubator-livy/search?utf8=%E2%9C%93&q=isRunningOnYarn&type=]
My thoughts are that we would begin looking into initializing the [SparkMesosApp.mesosClient|https://github.com/apache/incubator-livy/blob/75902ebf16bbe4eb03760023e35f02082747064c/server/src/main/scala/org/apache/livy/server/LivyServer.scala#L122-L126] similar to the existing logic.
Any thoughts [~ajbozarth] ? Thank you in advance.
> Livy Doesn't work with Spark on Mesos in cluster mode
> -----------------------------------------------------
>
> Key: LIVY-372
> URL: https://issues.apache.org/jira/browse/LIVY-372
> Project: Livy
> Issue Type: Bug
> Components: Core
> Affects Versions: 0.3
> Environment: Spark-2.1.0, Mesos-1.2.0
> Reporter: Yehor Krivokon
> Priority: Major
>
> Client mode - it works.
> But it doesnt work with cluster mode:
> Added to livy.conf:
> livy.spark.master=mesos://node1:5050
> livy.spark.deployMode=cluster
> Added to hue.ini:
> [spark]
> livy_server_host=node1
> livy_server_port=8998
> Livy logs:
> 17/06/16 21:00:26 INFO StateStore$: Using BlackholeStateStore for recovery.
> 17/06/16 21:00:26 INFO BatchSessionManager: Recovered 0 batch sessions. Next session id: 0
> 17/06/16 21:00:26 INFO InteractiveSessionManager: Recovered 0 interactive sessions. Next session id: 0
> 17/06/16 21:00:26 INFO InteractiveSessionManager: Heartbeat watchdog thread started.
> 17/06/16 21:00:26 INFO WebServer: Starting server on http://node1:8998
> 17/06/16 22:04:04 INFO LivyServer: Shutting down Livy server.
> 17/06/16 22:04:07 INFO StateStore$: Using BlackholeStateStore for recovery.
> 17/06/16 22:04:07 INFO BatchSessionManager: Recovered 0 batch sessions. Next session id: 0
> 17/06/16 22:04:07 INFO InteractiveSessionManager: Recovered 0 interactive sessions. Next session id: 0
> 17/06/16 22:04:07 INFO InteractiveSessionManager: Heartbeat watchdog thread started.
> 17/06/16 22:04:07 INFO WebServer: Starting server on http://node1:8998
> 17/06/16 22:05:07 INFO InteractiveSession$: Creating LivyClient for sessionId: 0
> 17/06/16 22:05:07 WARN RSCConf: Your hostname, node1, resolves to a loopback address, but we couldn't find any external IP addre$
> 17/06/16 22:05:07 WARN RSCConf: Set livy.rsc.rpc.server.address if you need to bind to another address. (edited)
> [5:01]
> 7/06/16 22:05:10 WARN ContextLauncher: Child process exited with code 1.
> 17/06/16 22:05:10 ERROR RSCClient: Failed to connect to context.
> java.io.IOException: Child process exited with code 1.
> at com.cloudera.livy.rsc.ContextLauncher$ChildProcess$1.run(ContextLauncher.java:416)
> at com.cloudera.livy.rsc.ContextLauncher$ChildProcess$2.run(ContextLauncher.java:490)
> at java.lang.Thread.run(Thread.java:748)
> 17/06/16 22:05:10 INFO RSCClient: Failing pending job 8a53d717-773c-434e-807f-3ceb46138a66 due to shutdown.
> 17/06/16 22:05:10 INFO InteractiveSession: Stopping InteractiveSession 0...
> 17/06/16 22:05:10 INFO InteractiveSession: Stopped InteractiveSession 0.
> 17/06/16 22:05:10 WARN InteractiveSession: (Fail to get rsc uri,java.util.concurrent.ExecutionException: java.io.IOException: Ch$
> 17/06/16 22:05:12 INFO InteractiveSession: Stopping InteractiveSession 0...
> 17/06/16 22:05:12 INFO InteractiveSession: Stopped InteractiveSession 0.
> 17/06/16 22:12:45 WARN HttpParser: Illegal character 0x16 in state=START for buffer HeapByteBuffer@5bf799a9[p=1,l=171,c=16384,r=$
> 17/06/16 22:12:45 WARN HttpParser: badMessage: 400 Illegal character 0x16 for HttpChannelOverHttp@10e07e9e{r=0,c=false,a=IDLE,ur$
> 17/06/16 22:58:15 WARN HttpParser: Illegal character 0x16 in state=START for buffer HeapByteBuffer@5bf799a9[p=1,l=171,c=16384,r=$
> 17/06/16 22:58:15 WARN HttpParser: badMessage: 400 Illegal character 0x16 for HttpChannelOverHttp@2ef0a110{r=0,c=false,a=ID
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)