You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@bahir.apache.org by "Steve Loughran (JIRA)" <ji...@apache.org> on 2016/07/20 11:32:20 UTC

[jira] [Commented] (BAHIR-35) Include Python code in the binary jars for use with "--packages ..."

    [ https://issues.apache.org/jira/browse/BAHIR-35?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15385706#comment-15385706 ] 

Steve Loughran commented on BAHIR-35:
-------------------------------------

What's bing proposed here. Just adding the bahir .py files into the binary .jar? 

> Include Python code in the binary jars for use with "--packages ..."
> --------------------------------------------------------------------
>
>                 Key: BAHIR-35
>                 URL: https://issues.apache.org/jira/browse/BAHIR-35
>             Project: Bahir
>          Issue Type: Task
>          Components: Build
>    Affects Versions: 2.0.0
>            Reporter: Christian Kadner
>   Original Estimate: 8h
>  Remaining Estimate: 8h
>
> Currently, to make use the PySpark code (i.e streaming-mqtt/python) a user will have to download the jar from Maven central or clone the code from GitHub and then have to find individual *.py files, create a zip and add that to the {{spark-submit}} command with the {{--py-files}} option, or, add them to the {{PYTHONPATH}} when running locally.
> If we include the Python code in the binary build (to the jar that gets uploaded to Maven central), then users need not do any acrobatics besides using the {{--packages ...}} option.
> An example where the Python code is part of the binary jar is the [GraphFrames|https://spark-packages.org/package/graphframes/graphframes] package.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)