You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Ash Berlin-Taylor (JIRA)" <ji...@apache.org> on 2019/07/17 12:11:00 UTC
[jira] [Resolved] (AIRFLOW-4717) The spark_binary arg to
SparkSubmitOperator has no effect when connection exists.
[ https://issues.apache.org/jira/browse/AIRFLOW-4717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ash Berlin-Taylor resolved AIRFLOW-4717.
----------------------------------------
Resolution: Duplicate
> The spark_binary arg to SparkSubmitOperator has no effect when connection exists.
> ---------------------------------------------------------------------------------
>
> Key: AIRFLOW-4717
> URL: https://issues.apache.org/jira/browse/AIRFLOW-4717
> Project: Apache Airflow
> Issue Type: Bug
> Components: hooks
> Affects Versions: 1.10.3
> Reporter: yunsu lee
> Assignee: yunsu lee
> Priority: Major
>
> Apache spark depending on the desto, the spark binary name may be different. (ex. spark2-submit)
> For this reason, the spark_binary option has been added to sparkSubmitOperator
> (Reference : [https://github.com/apache/airflow/pull/4360/files])
>
> However, this option does not work.
> This is because there is logic to hard-code and override the spark-binary option value in spark_submit_hook.py
> (Full path : airflow/contrib/hooks/spark_submit_hook.py)
>
> {code:java}
> ...
> conn_data['spark_binary'] = extra.get('spark-binary', "spark-submit")
> ...{code}
> It is necessary to delete the corresponding line.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)