You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by tzolov <gi...@git.apache.org> on 2014/09/21 12:22:05 UTC

[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

GitHub user tzolov opened a pull request:

    https://github.com/apache/spark/pull/2477

    [Build] SPARK-3624: Failed to find Spark assembly in /usr/share/spark/lib...

    Define a 'lib' symlink like this:
    
        lib -> /usr/share/spark/jars
    
    This required jdeb maven plugin update from version 0.11 to 1.3

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/tzolov/spark SPARK-3624

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/2477.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #2477
    
----
commit 24a9bd5b51c26bf2225be7d898bba2c17dc4b15c
Author: tzolov <ch...@gmail.com>
Date:   2014-09-21T10:11:59Z

    [Build] SPARK-3624:Failed to find Spark assembly in /usr/share/spark/lib for RELEASED debian packages

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by markgrover <gi...@git.apache.org>.
Github user markgrover commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-58454665
  
    Thanks for including me, @JoshRosen!
    
    I agree with @markhamstra here. 
    
    If I *had* to make this work, I wouldn't do the symlink. In fact, I'd update compute-classpath.sh, to, in case of a RELEASE, look under $FWDIR/lib AND $FWDIR/jars for the assembly jar. Symlink may cause some pain down the road, so I'd prefer a backwards compatible and simple change in the compute-classpath script.
    I am not an expert on how the jdeb plugin is being used in Spark, but I downloaded the latest binary tarball of spark and there is a "lib" directory there. I can see jdeb taking the contents of lib directory and essentially putting them under jars directory but I think it's a slippery slope to create the symlink. If somehow a lib directory shows up under /usr/share/spark/, this symlink creation would fail.
    
    However, I also want to take this opportunity to raise a meta point. Hadoop, and other projects when they were just starting, took the route of bundling their own rpm/deb packaging code along with the project. They soon realized that it was very hard to maintain that since the bread and butter of the project is writing good, solid distributed software and not figuring out why /var/run is a tempfs on latest Operating Systems. This is where Apache Bigtop was born out of - to house the packaging for all projects of Hadoop ecosystem and to integrate them with each other. Consequently, Apache Bigtop packaging is considered the de-facto packaging for Hadoop ecosystem projects. And, Spark is a part of it, has been for quite a while. So, while it's your call at the end of the day, I'd strongly encourage you to separate the packaging concern to Apache Bigtop since I think it's unreasonable to expect a development project to accommodate the needs of all Operating Systems and packaging stan
 dards.
    
    In interest of full-disclosure, I am a committer on the Apache Bigtop project.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by markhamstra <gi...@git.apache.org>.
Github user markhamstra commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-58293963
  
    This certainly works, but I'm not sure that we need to maintain the complexity of having both a `jars` directory and a `lib` symlink to it.  What we want at a minimum is for the unmodified scripts distributed with Spark (such as compute-classpath.sh) to work correctly in the package, so that looks like we now must have the assembly in `lib`.  The only reason to retain `jars` is for backward compatibility and not to break any existing users' scripts that expect the assembly in `jars`.  I'd rather not have both `lib` and `jars`, the Debian packaging has never really been a supported part of Spark, and anybody using the package with an installation management system (all that the package was every really intended for) is already going to be knee-deep in making changes to various configuration files and scripts, so I don't see the burden of having just `lib` and not `jars` as too onerous.
    
    In the end, it's a judgment call.  @pwendell or @mateiz ?  


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-62550983
  
    Sounds like this is a "WontFix"?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-73849000
  
    Per SPARK-5727, I believe the outstanding Debian issues should be closed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by tzolov <gi...@git.apache.org>.
Github user tzolov closed the pull request at:

    https://github.com/apache/spark/pull/2477


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by markhamstra <gi...@git.apache.org>.
Github user markhamstra commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-62570870
  
    No, we should definitely do something to make the Debian package functional again.  The question is whether we are concerned enough about backward compatibility to adopt the symlink approach of this PR or whether we just change `<prefix>${deb.install.path}/jars</prefix>` to `<prefix>${deb.install.path}/lib</prefix>`.
    
    The only real alternative to either of those approaches is to eliminate the Debian packaging entirely.  In any event, we shouldn't leave broken packaging in place. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-56294819
  
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-77875745
  
    Mind closing this PR?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by JoshRosen <gi...@git.apache.org>.
Github user JoshRosen commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-58291272
  
    @markhamstra or @markgrover, could you take a look at this PR?  I don't know anything about Spark's Debian packaging, but it looks like both of you have contributed changes to it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [Build] SPARK-3624: Failed to find Spark assem...

Posted by tzolov <gi...@git.apache.org>.
Github user tzolov commented on the pull request:

    https://github.com/apache/spark/pull/2477#issuecomment-77895022
  
    i'm closing this PR as this functionality is deprecated. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org