You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Juliet Hougland (JIRA)" <ji...@apache.org> on 2015/06/25 23:41:06 UTC
[jira] [Created] (SPARK-8646) PySpark does not run on YARN
Juliet Hougland created SPARK-8646:
--------------------------------------
Summary: PySpark does not run on YARN
Key: SPARK-8646
URL: https://issues.apache.org/jira/browse/SPARK-8646
Project: Spark
Issue Type: Bug
Components: PySpark, YARN
Affects Versions: 1.4.0
Environment: SPARK_HOME=local/path/to/spark1.4install/dir
also with
SPARK_HOME=local/path/to/spark1.4install/dir
PYTHONPATH=$SPARK_HOME/python/lib
Spark apps are submitted with the command:
$SPARK_HOME/bin/spark-submit outofstock/data_transform.py hdfs://foe-dev/DEMO_DATA/FACT_POS hdfs:/user/juliet/ex/ yarn-client
data_transform contains a main method, and the rest of the args are parsed in my own code.
Reporter: Juliet Hougland
Running pyspark jobs result in a "no module named pyspark" when run in yarn-client mode in spark 1.4.
[I believe this JIRA represents the change that introduced this error.| https://issues.apache.org/jira/browse/SPARK-6869 ]
This does not represent a binary compatible change to spark. Scripts that worked on previous spark versions (ie comands the use spark-submit) should continue to work without modification between minor versions.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org