You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2016/04/20 05:15:25 UTC

[jira] [Created] (SPARK-14744) Put examples packaging on a diet

Marcelo Vanzin created SPARK-14744:
--------------------------------------

             Summary: Put examples packaging on a diet
                 Key: SPARK-14744
                 URL: https://issues.apache.org/jira/browse/SPARK-14744
             Project: Spark
          Issue Type: Improvement
          Components: Examples
    Affects Versions: 2.0.0
            Reporter: Marcelo Vanzin
            Priority: Minor


Currently the examples bring in a lot of external dependencies, ballooning the size of the Spark distribution packages.

I'd like to propose two things to slim down these dependencies:

- make all non-Spark, and also Spark Streaming, dependencies "provided". This means, especially for streaming connectors, that launching examples becomes more like launching real applications (where you need to figure out how to provide those dependencies, e.g. using {{--packages}}).

- audit examples and remove those that don't provide a lot of value. For example, HBase is working on full-featured Spark bindings, based on code that has already been in use for a while before being merged into HBase. The HBase example in Spark is very bare bones and, in comparison, not really useful and in fact a little misleading.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org