You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Josh Rosen (JIRA)" <ji...@apache.org> on 2016/01/31 00:52:39 UTC
[jira] [Resolved] (SPARK-6029) Unshaded "clearspring" classpath
leakage + excluded fastutil interferes with apps using clearspring
[ https://issues.apache.org/jira/browse/SPARK-6029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Josh Rosen resolved SPARK-6029.
-------------------------------
Resolution: Incomplete
Resolving as "incomplete", since it's not clear whether this issue is still valid. If it is, please comment and we can re-open and re-scope. Thanks!
> Unshaded "clearspring" classpath leakage + excluded fastutil interferes with apps using clearspring
> ----------------------------------------------------------------------------------------------------
>
> Key: SPARK-6029
> URL: https://issues.apache.org/jira/browse/SPARK-6029
> Project: Spark
> Issue Type: Bug
> Components: Build
> Affects Versions: 1.2.1
> Reporter: Jim Kleckner
> Priority: Minor
>
> Spark includes the clearspring analytics package but intentionally excludes the dependencies of the fastutil package.
> Spark includes parquet-column which includes fastutil and relocates it under parquet/ but creates a shaded jar file which is incomplete because it shades out some of the fastutil classes, notably Long2LongOpenHashMap, which is present in the fastutil jar file that parquet-column is referencing.
> We are using more of the clearspring classes (e.g. QDigest) and those do depend on missing fastutil classes like Long2LongOpenHashMap.
> Even though I add them to our assembly jar file, the class loader finds the spark assembly and we get runtime class loader errors when we try to use it.
> The [documentaion|http://spark.apache.org/docs/1.2.0/configuration.html#runtime-environment] and possibly related issue [SPARK-939|https://issues.apache.org/jira/browse/SPARK-939] suggest arguments that I tried with spark-submit:
> {code}
> --conf spark.driver.userClassPathFirst=true \
> --conf spark.executor.userClassPathFirst=true
> {code}
> but we still get the class not found error.
> Could this be a bug with {{userClassPathFirst=true}}? i.e. should it work?
> In any case, would it be reasonable to not exclude the "fastutil" dependencies?
> See email discussion [here|http://apache-spark-user-list.1001560.n3.nabble.com/Fwd-Spark-excludes-quot-fastutil-quot-dependencies-we-need-tt21812.html]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org