You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (Jira)" <ji...@apache.org> on 2020/12/01 04:44:00 UTC

[jira] [Assigned] (SPARK-33530) Support --archives option natively

     [ https://issues.apache.org/jira/browse/SPARK-33530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyukjin Kwon reassigned SPARK-33530:
------------------------------------

    Assignee: Hyukjin Kwon

> Support --archives option natively
> ----------------------------------
>
>                 Key: SPARK-33530
>                 URL: https://issues.apache.org/jira/browse/SPARK-33530
>             Project: Spark
>          Issue Type: Improvement
>          Components: Spark Core
>    Affects Versions: 3.1.0
>            Reporter: Hyukjin Kwon
>            Assignee: Hyukjin Kwon
>            Priority: Major
>
> Currently, {{spark-submit --archives}} and {{spark.yarn.dist.archives}} configuration are only supported in Yarn modes:
> {code}
> spark-submit --help
> ...
>  Spark on YARN only:
>   --queue QUEUE_NAME          The YARN queue to submit to (Default: "default").
>   --archives ARCHIVES         Comma separated list of archives to be extracted into the
>                               working directory of each executor.
> {code}
> This is actually critical for PySpark to support shipping other packages together, see also https://hyukjin-spark.readthedocs.io/en/stable/user_guide/python_packaging.html#using-zipped-virtual-environment.
> Due to this missing feature, PySpark cannot support conda env to ship other packages together.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org