You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@hadoop.apache.org by David Parks <da...@yahoo.com> on 2013/03/28 07:07:07 UTC

Which hadoop installation should I use on ubuntu server?

I'm moving off AWS MapReduce to our own cluster, I'm installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if it'll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

Re: Which hadoop installation should I use on ubuntu server?

Posted by Marcos Luis Ortiz Valmaseda <ma...@gmail.com>.

In BigTop´s wiki, you can find this:
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.5.0#HowtoinstallHadoopdistributionfromBigtop0.5.0-Ubuntu%2864bit%2Clucid%2Cprecise%2Cquantal%29




2013/3/28 Ted Dunning <td...@maprtech.com>

> Also, Canonical just announced that MapR is available in the Partner repos.
>
>
> On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:
>
>> apache bigtop has builds done for ubuntu
>>
>> you can check them at jenkins mentioned on bigtop.apache.org
>>
>>
>> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>>
>>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop
>>> on Ubuntu Server 12.10.****
>>>
>>> ** **
>>>
>>> I see a .deb installer and installed that, but it seems like files are
>>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>>> And the documentation is a bit harder to follow:****
>>>
>>> ** **
>>>
>>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>>
>>> ** **
>>>
>>> So I just wonder if this installer is the best approach, or if it’ll be
>>> easier/better to just install the basic build in /opt/hadoop and perhaps
>>> the docs become easier to follow. Thoughts?****
>>>
>>> ** **
>>>
>>> Thanks,****
>>>
>>> Dave****
>>>
>>> ** **
>>>
>>
>>
>>
>> --
>> Nitin Pawar
>>
>
>


-- 
Marcos Ortiz Valmaseda,
*Data-Driven Product Manager* at PDVSA
*Blog*: http://dataddict.wordpress.com/
*LinkedIn: *http://www.linkedin.com/in/marcosluis2186
*Twitter*: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Marcos Luis Ortiz Valmaseda <ma...@gmail.com>.

In BigTop´s wiki, you can find this:
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.5.0#HowtoinstallHadoopdistributionfromBigtop0.5.0-Ubuntu%2864bit%2Clucid%2Cprecise%2Cquantal%29




2013/3/28 Ted Dunning <td...@maprtech.com>

> Also, Canonical just announced that MapR is available in the Partner repos.
>
>
> On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:
>
>> apache bigtop has builds done for ubuntu
>>
>> you can check them at jenkins mentioned on bigtop.apache.org
>>
>>
>> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>>
>>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop
>>> on Ubuntu Server 12.10.****
>>>
>>> ** **
>>>
>>> I see a .deb installer and installed that, but it seems like files are
>>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>>> And the documentation is a bit harder to follow:****
>>>
>>> ** **
>>>
>>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>>
>>> ** **
>>>
>>> So I just wonder if this installer is the best approach, or if it’ll be
>>> easier/better to just install the basic build in /opt/hadoop and perhaps
>>> the docs become easier to follow. Thoughts?****
>>>
>>> ** **
>>>
>>> Thanks,****
>>>
>>> Dave****
>>>
>>> ** **
>>>
>>
>>
>>
>> --
>> Nitin Pawar
>>
>
>


-- 
Marcos Ortiz Valmaseda,
*Data-Driven Product Manager* at PDVSA
*Blog*: http://dataddict.wordpress.com/
*LinkedIn: *http://www.linkedin.com/in/marcosluis2186
*Twitter*: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Marcos Luis Ortiz Valmaseda <ma...@gmail.com>.

In BigTop´s wiki, you can find this:
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.5.0#HowtoinstallHadoopdistributionfromBigtop0.5.0-Ubuntu%2864bit%2Clucid%2Cprecise%2Cquantal%29




2013/3/28 Ted Dunning <td...@maprtech.com>

> Also, Canonical just announced that MapR is available in the Partner repos.
>
>
> On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:
>
>> apache bigtop has builds done for ubuntu
>>
>> you can check them at jenkins mentioned on bigtop.apache.org
>>
>>
>> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>>
>>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop
>>> on Ubuntu Server 12.10.****
>>>
>>> ** **
>>>
>>> I see a .deb installer and installed that, but it seems like files are
>>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>>> And the documentation is a bit harder to follow:****
>>>
>>> ** **
>>>
>>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>>
>>> ** **
>>>
>>> So I just wonder if this installer is the best approach, or if it’ll be
>>> easier/better to just install the basic build in /opt/hadoop and perhaps
>>> the docs become easier to follow. Thoughts?****
>>>
>>> ** **
>>>
>>> Thanks,****
>>>
>>> Dave****
>>>
>>> ** **
>>>
>>
>>
>>
>> --
>> Nitin Pawar
>>
>
>


-- 
Marcos Ortiz Valmaseda,
*Data-Driven Product Manager* at PDVSA
*Blog*: http://dataddict.wordpress.com/
*LinkedIn: *http://www.linkedin.com/in/marcosluis2186
*Twitter*: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Marcos Luis Ortiz Valmaseda <ma...@gmail.com>.

In BigTop´s wiki, you can find this:
https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop+0.5.0#HowtoinstallHadoopdistributionfromBigtop0.5.0-Ubuntu%2864bit%2Clucid%2Cprecise%2Cquantal%29




2013/3/28 Ted Dunning <td...@maprtech.com>

> Also, Canonical just announced that MapR is available in the Partner repos.
>
>
> On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:
>
>> apache bigtop has builds done for ubuntu
>>
>> you can check them at jenkins mentioned on bigtop.apache.org
>>
>>
>> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>>
>>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop
>>> on Ubuntu Server 12.10.****
>>>
>>> ** **
>>>
>>> I see a .deb installer and installed that, but it seems like files are
>>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>>> And the documentation is a bit harder to follow:****
>>>
>>> ** **
>>>
>>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>>
>>> ** **
>>>
>>> So I just wonder if this installer is the best approach, or if it’ll be
>>> easier/better to just install the basic build in /opt/hadoop and perhaps
>>> the docs become easier to follow. Thoughts?****
>>>
>>> ** **
>>>
>>> Thanks,****
>>>
>>> Dave****
>>>
>>> ** **
>>>
>>
>>
>>
>> --
>> Nitin Pawar
>>
>
>


-- 
Marcos Ortiz Valmaseda,
*Data-Driven Product Manager* at PDVSA
*Blog*: http://dataddict.wordpress.com/
*LinkedIn: *http://www.linkedin.com/in/marcosluis2186
*Twitter*: @marcosluis2186 <http://twitter.com/marcosluis2186>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Ted Dunning <td...@maprtech.com>.

Also, Canonical just announced that MapR is available in the Partner repos.


On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:

> apache bigtop has builds done for ubuntu
>
> you can check them at jenkins mentioned on bigtop.apache.org
>
>
> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>
>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
>> Ubuntu Server 12.10.****
>>
>> ** **
>>
>> I see a .deb installer and installed that, but it seems like files are
>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>> And the documentation is a bit harder to follow:****
>>
>> ** **
>>
>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>
>> ** **
>>
>> So I just wonder if this installer is the best approach, or if it’ll be
>> easier/better to just install the basic build in /opt/hadoop and perhaps
>> the docs become easier to follow. Thoughts?****
>>
>> ** **
>>
>> Thanks,****
>>
>> Dave****
>>
>> ** **
>>
>
>
>
> --
> Nitin Pawar
>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bertrand Dechoux <de...@gmail.com>.

For information, the 50 node limit on CDH is a past limitation. It is no
longer the case.

*Support for unlimited nodes*. Previous versions of Cloudera Manager Free
> Edition limited the number of managed nodes to 50. This limitation has been
> removed.
>

https://ccp.cloudera.com/display/FREE45DOC/New+Features+in+Cloudera+Manager+Free+Edition+4

Bertrand

On Fri, Mar 29, 2013 at 10:59 AM, Bruno Mahé <bm...@apache.org> wrote:

> On 03/29/2013 01:09 AM, David Parks wrote:
>
>> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
>> seems like they’re creating and maintain basically an installer for
>> Hadoop?
>>
>> I tried following their docs for Ubuntu, but just get a 404 error on the
>> first step, so it makes me wonder how reliable that project is.
>>
>> https://cwiki.apache.org/**confluence/display/BIGTOP/How+**
>> to+install+Hadoop+**distribution+from+Bigtop<https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop>
>>
>> Has anyone actually used bigtop to deploy Hadoop in a production
>> environment?
>>
>>
>
> Hi David,
>
> You may want to send an email to Apache Bigtop mailing lists with your
> questions, but the goal of Apache Bigtop is three folds:
> 1/ Provide top notch packages for Apache Hadoop related projects for the
> most popular GNU/Linux distributions
> 2/ Provide a point of integration and testing for all these projects
> 3/ Provide means to reliably deploy a complete stack.
>
> Apache Bigtop was donated by Cloudera to the Apache Foundation and is
> now used as the base for CDH (Cloudera's distribution).
>
> All the projects supported by Apache Bigtop have packages for
> Debian/Ubuntu/SLES11/Fedora/**CentOS.
> They also have tests to exercise integration points between all of them
> (ex: Hive can use HBase which sits on top of HDFS). And in order to run
> these tests, we also have a test framework.
> Also before we can test for integration, we also have to ensure they can
> be properly installed/upgraded/removed, with the right users, ulimits,
> rights and so forth. So to that end, we also have a large chunk of the
> tests and testing framework dedicated to testing the packages themselves.
>
> And finally, regarding the deployment, we have the following:
> * Boxgrinder recipe people can use and modify to suit their need. This is
> useful if you want to create your own virtual machine (kvm, vmware,
> virtualbox) but also want to create images for ec2.
> * kickstart file to build a live fedora image
> * puppet recipes to deploy all these services.
>
>
> Regarding your issue with the instructions on the wiki, this is because
> they have not been updated since Apache Bigtop became a top level project,
> and the location of the artefacts has changed. When reading the
> instructions, please use the following as the base url:
> http://www.apache.org/dist/**bigtop/bigtop-0.5.0/<http://www.apache.org/dist/bigtop/bigtop-0.5.0/>
>
> Please, let me know if you still have some issues.
>
> Thanks,
> Bruno
>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bertrand Dechoux <de...@gmail.com>.

For information, the 50 node limit on CDH is a past limitation. It is no
longer the case.

*Support for unlimited nodes*. Previous versions of Cloudera Manager Free
> Edition limited the number of managed nodes to 50. This limitation has been
> removed.
>

https://ccp.cloudera.com/display/FREE45DOC/New+Features+in+Cloudera+Manager+Free+Edition+4

Bertrand

On Fri, Mar 29, 2013 at 10:59 AM, Bruno Mahé <bm...@apache.org> wrote:

> On 03/29/2013 01:09 AM, David Parks wrote:
>
>> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
>> seems like they’re creating and maintain basically an installer for
>> Hadoop?
>>
>> I tried following their docs for Ubuntu, but just get a 404 error on the
>> first step, so it makes me wonder how reliable that project is.
>>
>> https://cwiki.apache.org/**confluence/display/BIGTOP/How+**
>> to+install+Hadoop+**distribution+from+Bigtop<https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop>
>>
>> Has anyone actually used bigtop to deploy Hadoop in a production
>> environment?
>>
>>
>
> Hi David,
>
> You may want to send an email to Apache Bigtop mailing lists with your
> questions, but the goal of Apache Bigtop is three folds:
> 1/ Provide top notch packages for Apache Hadoop related projects for the
> most popular GNU/Linux distributions
> 2/ Provide a point of integration and testing for all these projects
> 3/ Provide means to reliably deploy a complete stack.
>
> Apache Bigtop was donated by Cloudera to the Apache Foundation and is
> now used as the base for CDH (Cloudera's distribution).
>
> All the projects supported by Apache Bigtop have packages for
> Debian/Ubuntu/SLES11/Fedora/**CentOS.
> They also have tests to exercise integration points between all of them
> (ex: Hive can use HBase which sits on top of HDFS). And in order to run
> these tests, we also have a test framework.
> Also before we can test for integration, we also have to ensure they can
> be properly installed/upgraded/removed, with the right users, ulimits,
> rights and so forth. So to that end, we also have a large chunk of the
> tests and testing framework dedicated to testing the packages themselves.
>
> And finally, regarding the deployment, we have the following:
> * Boxgrinder recipe people can use and modify to suit their need. This is
> useful if you want to create your own virtual machine (kvm, vmware,
> virtualbox) but also want to create images for ec2.
> * kickstart file to build a live fedora image
> * puppet recipes to deploy all these services.
>
>
> Regarding your issue with the instructions on the wiki, this is because
> they have not been updated since Apache Bigtop became a top level project,
> and the location of the artefacts has changed. When reading the
> instructions, please use the following as the base url:
> http://www.apache.org/dist/**bigtop/bigtop-0.5.0/<http://www.apache.org/dist/bigtop/bigtop-0.5.0/>
>
> Please, let me know if you still have some issues.
>
> Thanks,
> Bruno
>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bertrand Dechoux <de...@gmail.com>.

For information, the 50 node limit on CDH is a past limitation. It is no
longer the case.

*Support for unlimited nodes*. Previous versions of Cloudera Manager Free
> Edition limited the number of managed nodes to 50. This limitation has been
> removed.
>

https://ccp.cloudera.com/display/FREE45DOC/New+Features+in+Cloudera+Manager+Free+Edition+4

Bertrand

On Fri, Mar 29, 2013 at 10:59 AM, Bruno Mahé <bm...@apache.org> wrote:

> On 03/29/2013 01:09 AM, David Parks wrote:
>
>> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
>> seems like they’re creating and maintain basically an installer for
>> Hadoop?
>>
>> I tried following their docs for Ubuntu, but just get a 404 error on the
>> first step, so it makes me wonder how reliable that project is.
>>
>> https://cwiki.apache.org/**confluence/display/BIGTOP/How+**
>> to+install+Hadoop+**distribution+from+Bigtop<https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop>
>>
>> Has anyone actually used bigtop to deploy Hadoop in a production
>> environment?
>>
>>
>
> Hi David,
>
> You may want to send an email to Apache Bigtop mailing lists with your
> questions, but the goal of Apache Bigtop is three folds:
> 1/ Provide top notch packages for Apache Hadoop related projects for the
> most popular GNU/Linux distributions
> 2/ Provide a point of integration and testing for all these projects
> 3/ Provide means to reliably deploy a complete stack.
>
> Apache Bigtop was donated by Cloudera to the Apache Foundation and is
> now used as the base for CDH (Cloudera's distribution).
>
> All the projects supported by Apache Bigtop have packages for
> Debian/Ubuntu/SLES11/Fedora/**CentOS.
> They also have tests to exercise integration points between all of them
> (ex: Hive can use HBase which sits on top of HDFS). And in order to run
> these tests, we also have a test framework.
> Also before we can test for integration, we also have to ensure they can
> be properly installed/upgraded/removed, with the right users, ulimits,
> rights and so forth. So to that end, we also have a large chunk of the
> tests and testing framework dedicated to testing the packages themselves.
>
> And finally, regarding the deployment, we have the following:
> * Boxgrinder recipe people can use and modify to suit their need. This is
> useful if you want to create your own virtual machine (kvm, vmware,
> virtualbox) but also want to create images for ec2.
> * kickstart file to build a live fedora image
> * puppet recipes to deploy all these services.
>
>
> Regarding your issue with the instructions on the wiki, this is because
> they have not been updated since Apache Bigtop became a top level project,
> and the location of the artefacts has changed. When reading the
> instructions, please use the following as the base url:
> http://www.apache.org/dist/**bigtop/bigtop-0.5.0/<http://www.apache.org/dist/bigtop/bigtop-0.5.0/>
>
> Please, let me know if you still have some issues.
>
> Thanks,
> Bruno
>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bertrand Dechoux <de...@gmail.com>.

For information, the 50 node limit on CDH is a past limitation. It is no
longer the case.

*Support for unlimited nodes*. Previous versions of Cloudera Manager Free
> Edition limited the number of managed nodes to 50. This limitation has been
> removed.
>

https://ccp.cloudera.com/display/FREE45DOC/New+Features+in+Cloudera+Manager+Free+Edition+4

Bertrand

On Fri, Mar 29, 2013 at 10:59 AM, Bruno Mahé <bm...@apache.org> wrote:

> On 03/29/2013 01:09 AM, David Parks wrote:
>
>> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
>> seems like they’re creating and maintain basically an installer for
>> Hadoop?
>>
>> I tried following their docs for Ubuntu, but just get a 404 error on the
>> first step, so it makes me wonder how reliable that project is.
>>
>> https://cwiki.apache.org/**confluence/display/BIGTOP/How+**
>> to+install+Hadoop+**distribution+from+Bigtop<https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop>
>>
>> Has anyone actually used bigtop to deploy Hadoop in a production
>> environment?
>>
>>
>
> Hi David,
>
> You may want to send an email to Apache Bigtop mailing lists with your
> questions, but the goal of Apache Bigtop is three folds:
> 1/ Provide top notch packages for Apache Hadoop related projects for the
> most popular GNU/Linux distributions
> 2/ Provide a point of integration and testing for all these projects
> 3/ Provide means to reliably deploy a complete stack.
>
> Apache Bigtop was donated by Cloudera to the Apache Foundation and is
> now used as the base for CDH (Cloudera's distribution).
>
> All the projects supported by Apache Bigtop have packages for
> Debian/Ubuntu/SLES11/Fedora/**CentOS.
> They also have tests to exercise integration points between all of them
> (ex: Hive can use HBase which sits on top of HDFS). And in order to run
> these tests, we also have a test framework.
> Also before we can test for integration, we also have to ensure they can
> be properly installed/upgraded/removed, with the right users, ulimits,
> rights and so forth. So to that end, we also have a large chunk of the
> tests and testing framework dedicated to testing the packages themselves.
>
> And finally, regarding the deployment, we have the following:
> * Boxgrinder recipe people can use and modify to suit their need. This is
> useful if you want to create your own virtual machine (kvm, vmware,
> virtualbox) but also want to create images for ec2.
> * kickstart file to build a live fedora image
> * puppet recipes to deploy all these services.
>
>
> Regarding your issue with the instructions on the wiki, this is because
> they have not been updated since Apache Bigtop became a top level project,
> and the location of the artefacts has changed. When reading the
> instructions, please use the following as the base url:
> http://www.apache.org/dist/**bigtop/bigtop-0.5.0/<http://www.apache.org/dist/bigtop/bigtop-0.5.0/>
>
> Please, let me know if you still have some issues.
>
> Thanks,
> Bruno
>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bruno Mahé <bm...@apache.org>.

On 03/29/2013 01:09 AM, David Parks wrote:
> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
> seems like they’re creating and maintain basically an installer for Hadoop?
>
> I tried following their docs for Ubuntu, but just get a 404 error on the
> first step, so it makes me wonder how reliable that project is.
>
> https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop
>
> Has anyone actually used bigtop to deploy Hadoop in a production
> environment?
>

Hi David,

You may want to send an email to Apache Bigtop mailing lists with your 
questions, but the goal of Apache Bigtop is three folds:
1/ Provide top notch packages for Apache Hadoop related projects for the 
most popular GNU/Linux distributions
2/ Provide a point of integration and testing for all these projects
3/ Provide means to reliably deploy a complete stack.

Apache Bigtop was donated by Cloudera to the Apache Foundation and is
now used as the base for CDH (Cloudera's distribution).

All the projects supported by Apache Bigtop have packages for 
Debian/Ubuntu/SLES11/Fedora/CentOS.
They also have tests to exercise integration points between all of them
(ex: Hive can use HBase which sits on top of HDFS). And in order to run
these tests, we also have a test framework.
Also before we can test for integration, we also have to ensure they can 
be properly installed/upgraded/removed, with the right users, ulimits, 
rights and so forth. So to that end, we also have a large chunk of the 
tests and testing framework dedicated to testing the packages themselves.

And finally, regarding the deployment, we have the following:
* Boxgrinder recipe people can use and modify to suit their need. This 
is useful if you want to create your own virtual machine (kvm, vmware, 
virtualbox) but also want to create images for ec2.
* kickstart file to build a live fedora image
* puppet recipes to deploy all these services.

Regarding your issue with the instructions on the wiki, this is because 
they have not been updated since Apache Bigtop became a top level 
project, and the location of the artefacts has changed. When reading the 
instructions, please use the following as the base url: 
http://www.apache.org/dist/bigtop/bigtop-0.5.0/

Please, let me know if you still have some issues.

Thanks,
Bruno

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bruno Mahé <bm...@apache.org>.

On 03/29/2013 01:09 AM, David Parks wrote:
> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
> seems like they’re creating and maintain basically an installer for Hadoop?
>
> I tried following their docs for Ubuntu, but just get a 404 error on the
> first step, so it makes me wonder how reliable that project is.
>
> https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop
>
> Has anyone actually used bigtop to deploy Hadoop in a production
> environment?
>

Hi David,

You may want to send an email to Apache Bigtop mailing lists with your 
questions, but the goal of Apache Bigtop is three folds:
1/ Provide top notch packages for Apache Hadoop related projects for the 
most popular GNU/Linux distributions
2/ Provide a point of integration and testing for all these projects
3/ Provide means to reliably deploy a complete stack.

Apache Bigtop was donated by Cloudera to the Apache Foundation and is
now used as the base for CDH (Cloudera's distribution).

All the projects supported by Apache Bigtop have packages for 
Debian/Ubuntu/SLES11/Fedora/CentOS.
They also have tests to exercise integration points between all of them
(ex: Hive can use HBase which sits on top of HDFS). And in order to run
these tests, we also have a test framework.
Also before we can test for integration, we also have to ensure they can 
be properly installed/upgraded/removed, with the right users, ulimits, 
rights and so forth. So to that end, we also have a large chunk of the 
tests and testing framework dedicated to testing the packages themselves.

And finally, regarding the deployment, we have the following:
* Boxgrinder recipe people can use and modify to suit their need. This 
is useful if you want to create your own virtual machine (kvm, vmware, 
virtualbox) but also want to create images for ec2.
* kickstart file to build a live fedora image
* puppet recipes to deploy all these services.

Regarding your issue with the instructions on the wiki, this is because 
they have not been updated since Apache Bigtop became a top level 
project, and the location of the artefacts has changed. When reading the 
instructions, please use the following as the base url: 
http://www.apache.org/dist/bigtop/bigtop-0.5.0/

Please, let me know if you still have some issues.

Thanks,
Bruno

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bruno Mahé <bm...@apache.org>.

On 03/29/2013 01:09 AM, David Parks wrote:
> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
> seems like they’re creating and maintain basically an installer for Hadoop?
>
> I tried following their docs for Ubuntu, but just get a 404 error on the
> first step, so it makes me wonder how reliable that project is.
>
> https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop
>
> Has anyone actually used bigtop to deploy Hadoop in a production
> environment?
>

Hi David,

You may want to send an email to Apache Bigtop mailing lists with your 
questions, but the goal of Apache Bigtop is three folds:
1/ Provide top notch packages for Apache Hadoop related projects for the 
most popular GNU/Linux distributions
2/ Provide a point of integration and testing for all these projects
3/ Provide means to reliably deploy a complete stack.

Apache Bigtop was donated by Cloudera to the Apache Foundation and is
now used as the base for CDH (Cloudera's distribution).

All the projects supported by Apache Bigtop have packages for 
Debian/Ubuntu/SLES11/Fedora/CentOS.
They also have tests to exercise integration points between all of them
(ex: Hive can use HBase which sits on top of HDFS). And in order to run
these tests, we also have a test framework.
Also before we can test for integration, we also have to ensure they can 
be properly installed/upgraded/removed, with the right users, ulimits, 
rights and so forth. So to that end, we also have a large chunk of the 
tests and testing framework dedicated to testing the packages themselves.

And finally, regarding the deployment, we have the following:
* Boxgrinder recipe people can use and modify to suit their need. This 
is useful if you want to create your own virtual machine (kvm, vmware, 
virtualbox) but also want to create images for ec2.
* kickstart file to build a live fedora image
* puppet recipes to deploy all these services.

Regarding your issue with the instructions on the wiki, this is because 
they have not been updated since Apache Bigtop became a top level 
project, and the location of the artefacts has changed. When reading the 
instructions, please use the following as the base url: 
http://www.apache.org/dist/bigtop/bigtop-0.5.0/

Please, let me know if you still have some issues.

Thanks,
Bruno

Re: Which hadoop installation should I use on ubuntu server?

Posted by Bruno Mahé <bm...@apache.org>.

On 03/29/2013 01:09 AM, David Parks wrote:
> Hmm, seems intriguing. I’m still not totally clear on bigtop here. It
> seems like they’re creating and maintain basically an installer for Hadoop?
>
> I tried following their docs for Ubuntu, but just get a 404 error on the
> first step, so it makes me wonder how reliable that project is.
>
> https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+distribution+from+Bigtop
>
> Has anyone actually used bigtop to deploy Hadoop in a production
> environment?
>

Hi David,

You may want to send an email to Apache Bigtop mailing lists with your 
questions, but the goal of Apache Bigtop is three folds:
1/ Provide top notch packages for Apache Hadoop related projects for the 
most popular GNU/Linux distributions
2/ Provide a point of integration and testing for all these projects
3/ Provide means to reliably deploy a complete stack.

Apache Bigtop was donated by Cloudera to the Apache Foundation and is
now used as the base for CDH (Cloudera's distribution).

All the projects supported by Apache Bigtop have packages for 
Debian/Ubuntu/SLES11/Fedora/CentOS.
They also have tests to exercise integration points between all of them
(ex: Hive can use HBase which sits on top of HDFS). And in order to run
these tests, we also have a test framework.
Also before we can test for integration, we also have to ensure they can 
be properly installed/upgraded/removed, with the right users, ulimits, 
rights and so forth. So to that end, we also have a large chunk of the 
tests and testing framework dedicated to testing the packages themselves.

And finally, regarding the deployment, we have the following:
* Boxgrinder recipe people can use and modify to suit their need. This 
is useful if you want to create your own virtual machine (kvm, vmware, 
virtualbox) but also want to create images for ec2.
* kickstart file to build a live fedora image
* puppet recipes to deploy all these services.

Regarding your issue with the instructions on the wiki, this is because 
they have not been updated since Apache Bigtop became a top level 
project, and the location of the artefacts has changed. When reading the 
instructions, please use the following as the base url: 
http://www.apache.org/dist/bigtop/bigtop-0.5.0/

Please, let me know if you still have some issues.

Thanks,
Bruno

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Hmm, seems intriguing. I'm still not totally clear on bigtop here. It seems
like they're creating and maintain basically an installer for Hadoop?

 

I tried following their docs for Ubuntu, but just get a 404 error on the
first step, so it makes me wonder how reliable that project is.

 

https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+dis
tribution+from+Bigtop

 

Has anyone actually used bigtop to deploy Hadoop in a production
environment?

 

 

From: Nitin Pawar [mailto:nitinpawar432@gmail.com] 
Sent: Thursday, March 28, 2013 1:22 PM
To: user@hadoop.apache.org
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

apache bigtop has builds done for ubuntu

 

you can check them at jenkins mentioned on bigtop.apache.org 

 

On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>
wrote:

I'm moving off AWS MapReduce to our own cluster, I'm installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if it'll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Nitin Pawar

Re: Which hadoop installation should I use on ubuntu server?

Posted by Ted Dunning <td...@maprtech.com>.

Also, Canonical just announced that MapR is available in the Partner repos.


On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:

> apache bigtop has builds done for ubuntu
>
> you can check them at jenkins mentioned on bigtop.apache.org
>
>
> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>
>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
>> Ubuntu Server 12.10.****
>>
>> ** **
>>
>> I see a .deb installer and installed that, but it seems like files are
>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>> And the documentation is a bit harder to follow:****
>>
>> ** **
>>
>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>
>> ** **
>>
>> So I just wonder if this installer is the best approach, or if it’ll be
>> easier/better to just install the basic build in /opt/hadoop and perhaps
>> the docs become easier to follow. Thoughts?****
>>
>> ** **
>>
>> Thanks,****
>>
>> Dave****
>>
>> ** **
>>
>
>
>
> --
> Nitin Pawar
>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Ted Dunning <td...@maprtech.com>.

Also, Canonical just announced that MapR is available in the Partner repos.


On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:

> apache bigtop has builds done for ubuntu
>
> you can check them at jenkins mentioned on bigtop.apache.org
>
>
> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>
>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
>> Ubuntu Server 12.10.****
>>
>> ** **
>>
>> I see a .deb installer and installed that, but it seems like files are
>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>> And the documentation is a bit harder to follow:****
>>
>> ** **
>>
>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>
>> ** **
>>
>> So I just wonder if this installer is the best approach, or if it’ll be
>> easier/better to just install the basic build in /opt/hadoop and perhaps
>> the docs become easier to follow. Thoughts?****
>>
>> ** **
>>
>> Thanks,****
>>
>> Dave****
>>
>> ** **
>>
>
>
>
> --
> Nitin Pawar
>

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Hmm, seems intriguing. I'm still not totally clear on bigtop here. It seems
like they're creating and maintain basically an installer for Hadoop?

 

I tried following their docs for Ubuntu, but just get a 404 error on the
first step, so it makes me wonder how reliable that project is.

 

https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+dis
tribution+from+Bigtop

 

Has anyone actually used bigtop to deploy Hadoop in a production
environment?

 

 

From: Nitin Pawar [mailto:nitinpawar432@gmail.com] 
Sent: Thursday, March 28, 2013 1:22 PM
To: user@hadoop.apache.org
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

apache bigtop has builds done for ubuntu

 

you can check them at jenkins mentioned on bigtop.apache.org 

 

On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>
wrote:

I'm moving off AWS MapReduce to our own cluster, I'm installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if it'll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Nitin Pawar

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Hmm, seems intriguing. I'm still not totally clear on bigtop here. It seems
like they're creating and maintain basically an installer for Hadoop?

 

I tried following their docs for Ubuntu, but just get a 404 error on the
first step, so it makes me wonder how reliable that project is.

 

https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+dis
tribution+from+Bigtop

 

Has anyone actually used bigtop to deploy Hadoop in a production
environment?

 

 

From: Nitin Pawar [mailto:nitinpawar432@gmail.com] 
Sent: Thursday, March 28, 2013 1:22 PM
To: user@hadoop.apache.org
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

apache bigtop has builds done for ubuntu

 

you can check them at jenkins mentioned on bigtop.apache.org 

 

On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>
wrote:

I'm moving off AWS MapReduce to our own cluster, I'm installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if it'll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Nitin Pawar

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Hmm, seems intriguing. I'm still not totally clear on bigtop here. It seems
like they're creating and maintain basically an installer for Hadoop?

 

I tried following their docs for Ubuntu, but just get a 404 error on the
first step, so it makes me wonder how reliable that project is.

 

https://cwiki.apache.org/confluence/display/BIGTOP/How+to+install+Hadoop+dis
tribution+from+Bigtop

 

Has anyone actually used bigtop to deploy Hadoop in a production
environment?

 

 

From: Nitin Pawar [mailto:nitinpawar432@gmail.com] 
Sent: Thursday, March 28, 2013 1:22 PM
To: user@hadoop.apache.org
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

apache bigtop has builds done for ubuntu

 

you can check them at jenkins mentioned on bigtop.apache.org 

 

On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>
wrote:

I'm moving off AWS MapReduce to our own cluster, I'm installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if it'll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Nitin Pawar

Re: Which hadoop installation should I use on ubuntu server?

Posted by Ted Dunning <td...@maprtech.com>.

Also, Canonical just announced that MapR is available in the Partner repos.


On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar <ni...@gmail.com>wrote:

> apache bigtop has builds done for ubuntu
>
> you can check them at jenkins mentioned on bigtop.apache.org
>
>
> On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:
>
>> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
>> Ubuntu Server 12.10.****
>>
>> ** **
>>
>> I see a .deb installer and installed that, but it seems like files are
>> all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`.
>> And the documentation is a bit harder to follow:****
>>
>> ** **
>>
>> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>>
>> ** **
>>
>> So I just wonder if this installer is the best approach, or if it’ll be
>> easier/better to just install the basic build in /opt/hadoop and perhaps
>> the docs become easier to follow. Thoughts?****
>>
>> ** **
>>
>> Thanks,****
>>
>> Dave****
>>
>> ** **
>>
>
>
>
> --
> Nitin Pawar
>

Re: Which hadoop installation should I use on ubuntu server?

Posted by Nitin Pawar <ni...@gmail.com>.

apache bigtop has builds done for ubuntu

you can check them at jenkins mentioned on bigtop.apache.org


On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Nitin Pawar

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Ive never used the Cloudera distributions, but you cant not hear about
them. Is it really much easier to manage the whole platform using clouderas
manager?   50 nodes free is generous enough that Id feel comfortable
committing to them as a platform (and thus the future potential cost), I
think. 

 

My only real experience comes from AWSs environment, which, other than
having a dedicated DFS, and launching jobs via their steps process, they
seem like a pretty straight forward Hadoop configuration.

 

Dave

 

 

From: Håvard Wahl Kongsgård [mailto:haavard.kongsgaard@gmail.com] 
Sent: Friday, March 29, 2013 3:21 PM
To: user
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

I recommend cloudera's CDH4 on ubuntu 12.04 LTS

 

On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

Im moving off AWS MapReduce to our own cluster, Im installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if itll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Ive never used the Cloudera distributions, but you cant not hear about
them. Is it really much easier to manage the whole platform using clouderas
manager?   50 nodes free is generous enough that Id feel comfortable
committing to them as a platform (and thus the future potential cost), I
think. 

 

My only real experience comes from AWSs environment, which, other than
having a dedicated DFS, and launching jobs via their steps process, they
seem like a pretty straight forward Hadoop configuration.

 

Dave

 

 

From: Håvard Wahl Kongsgård [mailto:haavard.kongsgaard@gmail.com] 
Sent: Friday, March 29, 2013 3:21 PM
To: user
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

I recommend cloudera's CDH4 on ubuntu 12.04 LTS

 

On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

Im moving off AWS MapReduce to our own cluster, Im installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if itll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Ive never used the Cloudera distributions, but you cant not hear about
them. Is it really much easier to manage the whole platform using clouderas
manager?   50 nodes free is generous enough that Id feel comfortable
committing to them as a platform (and thus the future potential cost), I
think. 

 

My only real experience comes from AWSs environment, which, other than
having a dedicated DFS, and launching jobs via their steps process, they
seem like a pretty straight forward Hadoop configuration.

 

Dave

 

 

From: Håvard Wahl Kongsgård [mailto:haavard.kongsgaard@gmail.com] 
Sent: Friday, March 29, 2013 3:21 PM
To: user
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

I recommend cloudera's CDH4 on ubuntu 12.04 LTS

 

On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

Im moving off AWS MapReduce to our own cluster, Im installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if itll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/

RE: Which hadoop installation should I use on ubuntu server?

Posted by David Parks <da...@yahoo.com>.

Ive never used the Cloudera distributions, but you cant not hear about
them. Is it really much easier to manage the whole platform using clouderas
manager?   50 nodes free is generous enough that Id feel comfortable
committing to them as a platform (and thus the future potential cost), I
think. 

 

My only real experience comes from AWSs environment, which, other than
having a dedicated DFS, and launching jobs via their steps process, they
seem like a pretty straight forward Hadoop configuration.

 

Dave

 

 

From: Håvard Wahl Kongsgård [mailto:haavard.kongsgaard@gmail.com] 
Sent: Friday, March 29, 2013 3:21 PM
To: user
Subject: Re: Which hadoop installation should I use on ubuntu server?

 

I recommend cloudera's CDH4 on ubuntu 12.04 LTS

 

On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

Im moving off AWS MapReduce to our own cluster, Im installing Hadoop on
Ubuntu Server 12.10.

 

I see a .deb installer and installed that, but it seems like files are all
over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
the documentation is a bit harder to follow:

 

http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html

 

So I just wonder if this installer is the best approach, or if itll be
easier/better to just install the basic build in /opt/hadoop and perhaps the
docs become easier to follow. Thoughts?

 

Thanks,

Dave

 





 

-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/

Re: Which hadoop installation should I use on ubuntu server?

Posted by Håvard Wahl Kongsgård <ha...@gmail.com>.

I recommend cloudera's CDH4 on ubuntu 12.04 LTS


On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/

Re: Which hadoop installation should I use on ubuntu server?

Posted by Nitin Pawar <ni...@gmail.com>.

apache bigtop has builds done for ubuntu

you can check them at jenkins mentioned on bigtop.apache.org


On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Nitin Pawar

Re: Which hadoop installation should I use on ubuntu server?

Posted by Nitin Pawar <ni...@gmail.com>.

apache bigtop has builds done for ubuntu

you can check them at jenkins mentioned on bigtop.apache.org


On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Nitin Pawar

Re: Which hadoop installation should I use on ubuntu server?

Posted by Håvard Wahl Kongsgård <ha...@gmail.com>.

I recommend cloudera's CDH4 on ubuntu 12.04 LTS


On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/

Re: Which hadoop installation should I use on ubuntu server?

Posted by Håvard Wahl Kongsgård <ha...@gmail.com>.

I recommend cloudera's CDH4 on ubuntu 12.04 LTS


On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/

Re: Which hadoop installation should I use on ubuntu server?

Posted by Nitin Pawar <ni...@gmail.com>.

apache bigtop has builds done for ubuntu

you can check them at jenkins mentioned on bigtop.apache.org


On Thu, Mar 28, 2013 at 11:37 AM, David Parks <da...@yahoo.com>wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Nitin Pawar

Re: Which hadoop installation should I use on ubuntu server?

Posted by Håvard Wahl Kongsgård <ha...@gmail.com>.

I recommend cloudera's CDH4 on ubuntu 12.04 LTS


On Thu, Mar 28, 2013 at 7:07 AM, David Parks <da...@yahoo.com> wrote:

> I’m moving off AWS MapReduce to our own cluster, I’m installing Hadoop on
> Ubuntu Server 12.10.****
>
> ** **
>
> I see a .deb installer and installed that, but it seems like files are all
> over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And
> the documentation is a bit harder to follow:****
>
> ** **
>
> http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html****
>
> ** **
>
> So I just wonder if this installer is the best approach, or if it’ll be
> easier/better to just install the basic build in /opt/hadoop and perhaps
> the docs become easier to follow. Thoughts?****
>
> ** **
>
> Thanks,****
>
> Dave****
>
> ** **
>



-- 
Håvard Wahl Kongsgård
Data Scientist
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.dbkeeping.com/