You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Thomas Koch (JIRA)" <ji...@apache.org> on 2010/02/25 16:17:28 UTC

[jira] Created: (HADOOP-6597) additional source only release tarball

additional source only release tarball
--------------------------------------

                 Key: HADOOP-6597
                 URL: https://issues.apache.org/jira/browse/HADOOP-6597
             Project: Hadoop Common
          Issue Type: Wish
            Reporter: Thomas Koch
            Priority: Trivial


One common annoyance when packaging java applications for a Free Software distribution is the necessity to repackage the upstream tarball. The repackaging is necessary, because Debian may only distribute binary files build from source that's also available from Debian.
So we build the jar/war files ourselfes to make sure there's nothing we don't have the sources for.
It would take one (annoying and time consuming) step less for packagers, if java upstream projects would release an additional tarball without any binary files or third party code.
I'm asking you first, because many other projects (like zookeeper) took or take hadoop as an example for their build infrastructure.
For your orientation, these are the patterns that I used to filter the hadoop tarball: (Usable with tar --exclude)

"*.jar",
"uming.*",
"prototype.js",
"config.sub",
"config.guess",
"ltmain.sh",
"Makefile.in",
"configure",
"aclocal.m4",
"config.h.in",
"install-sh",
"autom4te.cache",
"depcomp",
"missing",
"pipes/compile",
"src/contrib/eclipse-plugin/resources/*.jpg",
"src/contrib/eclipse-plugin/resources/*.png",
"src/contrib/eclipse-plugin/resources/*.gif",
"hadoop-0.20.1/src/core/org/apache/hadoop/record/compiler/generated/*.java",
"hadoop-0.20.1/src/docs/cn/build",
"hadoop-0.20.1/c++",
"hadoop-0.20.1/contrib",
"hadoop-0.20.1/lib/native",
"hadoop-0.20.1/librecordio",
"hadoop-0.20.1/src/contrib/thriftfs/gen-*",
"hadoop-0.20.1/docs",

There were different reasons why stuff needed to be filtered:
- unclear license (uming.*)
- unclear origin (images in the eclipse plugin)
- precompiled documentation / code / hadoop binaries
- pregenerated C/C++ automake files
- third party libraries (prototype.js, lib/*.jar)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.