You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Zhenhua Xu (JIRA)" <ji...@apache.org> on 2016/09/18 07:47:20 UTC
[jira] [Comment Edited] (SPARK-17566) "--master yarn --deploy-mode
cluster" gives "Launching Python applications through spark-submit is
currently only supported for local files"
[ https://issues.apache.org/jira/browse/SPARK-17566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15500502#comment-15500502 ]
Zhenhua Xu edited comment on SPARK-17566 at 9/18/16 7:46 AM:
-------------------------------------------------------------
[~saisai_shao] I did some debugging, the issue seems to be on Line 634 of SparkSubmit.scala. If I change it to
if(isYarnCluster) {
sysProps.get("spark.submit.pyFiles").foreach { pyFiles =>
val resolvedPyFiles = Utils.resolveURIs(pyFiles)
val formattedPyFiles = PythonRunner.formatPaths(resolvedPyFiles).mkString(",")
sysProps("spark.submit.pyFiles") = formattedPyFiles
}
}
Things start to work. Not sure how this affect other submit flow though.
was (Author: zhenhua.xu):
[~saisai_shao] I did some debugging, the issue seems to be on Line 634 of SparkSubmit.scala. If I change it to
if(isYarnCluster) {
sysProps.get("spark.submit.pyFiles").foreach { pyFiles =>
val resolvedPyFiles = Utils.resolveURIs(pyFiles)
val formattedPyFiles = PythonRunner.formatPaths(resolvedPyFiles).mkString(",")
sysProps("spark.submit.pyFiles") = formattedPyFiles
}
}
Things start to work.
> "--master yarn --deploy-mode cluster" gives "Launching Python applications through spark-submit is currently only supported for local files"
> --------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: SPARK-17566
> URL: https://issues.apache.org/jira/browse/SPARK-17566
> Project: Spark
> Issue Type: Bug
> Components: Spark Submit
> Affects Versions: 2.0.0
> Reporter: Zhenhua Xu
>
> In Spark 1.6, the following command runs fine with both primary and additional python files in hdfs.
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn-cluster hdfs:///tmp/pi.py
> In Spark 2.0.0, the following command fails:
> /bin/spark-submit --py-files hdfs:///tmp/base.py --master yarn --deploy-mode cluster hdfs:///tmp/pi.py
> Error:
> Launching Python applications through spark-submit is currently only supported for local files: hdfs:///tmp/base.py
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org