You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2014/11/25 12:48:12 UTC

[jira] [Commented] (SPARK-2192) Examples Data Not in Binary Distribution

    [ https://issues.apache.org/jira/browse/SPARK-2192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14224436#comment-14224436 ] 

Sean Owen commented on SPARK-2192:
----------------------------------

Data files are now consolidated under "data/", and they are not in the binary distribution. It would be easy to add them, and seems like a reasonable thing to do. However, I'm not clear all of those data files can be distributed; MovieLens data for example isn't supposed to be AFAIK. In fact, I'm not clear it should be in the Spark repo even.

Any support for me adding this to the distro, but removing examples based on things like Movielens that shouldn't be redistributed?

> Examples Data Not in Binary Distribution
> ----------------------------------------
>
>                 Key: SPARK-2192
>                 URL: https://issues.apache.org/jira/browse/SPARK-2192
>             Project: Spark
>          Issue Type: Bug
>          Components: Build
>    Affects Versions: 1.0.0
>            Reporter: Pat McDonough
>
> The data used by examples is not packaged up with the binary distribution. The data subdirectory of spark should make it's way in to the distribution somewhere so the examples can use it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org