You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Sahil Takiar (JIRA)" <ji...@apache.org> on 2017/04/19 22:22:41 UTC
[jira] [Created] (HIVE-16484) Investigate SparkLauncher for HoS as
alternative to bin/spark-submit
Sahil Takiar created HIVE-16484:
-----------------------------------
Summary: Investigate SparkLauncher for HoS as alternative to bin/spark-submit
Key: HIVE-16484
URL: https://issues.apache.org/jira/browse/HIVE-16484
Project: Hive
Issue Type: Bug
Components: Spark
Reporter: Sahil Takiar
Assignee: Sahil Takiar
The {{SparkClientImpl#startDriver}} currently looks for the {{SPARK_HOME}} directory and invokes the {{bin/spark-submit}} script, which spawns a separate process to run the Spark application.
{{SparkLauncher}} was added in SPARK-4924 and is a programatic way to launch Spark applications.
I see a few advantages:
* No need to spawn a separate process to launch a HoS --> lower startup time
* Simplifies the code in {{SparkClientImpl}} --> easier to debug
* {{SparkLauncher#startApplication}} returns a {{SparkAppHandle}} which contains some useful utilities for querying the state of the Spark job
** It also allows the launcher to specify a list of job listeners
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)