You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Robin Anil <ro...@gmail.com> on 2010/10/25 08:27:05 UTC

Re: Mahout dependencies on windows

Hadoop is not supported on windows, and Mahout is written completely on top
of Hadoop libraries. So we can't help you there. Maybe someone on this list
may have experience with hacking Mahout to work on windows


On Mon, Oct 25, 2010 at 11:34 AM, Divya <di...@k2associates.com.sg> wrote:

> Hi,
>
> Is it must to install cygwin and Hadoop to work with Mahout.
>
> Cant I install/work with mahout standalone on windows.
>
>
>
>
>
> Regards,
>
> Divya
>
>

Re: Mahout dependencies on windows

Posted by Jeff Eastman <jd...@windwardsolutions.com>.
Building from scratch is now even easier than on our wiki, as Cloudera 
CHD3 is now available containing Hadoop 0.22.2 
(http://www.cloudera.com/downloads/). Once you have downloaded and 
started according to their instructions, jump to #7 in 
https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Amazon+EC2 
or just:

# svn co http://svn.apache.org/repos/asf/mahout/trunk mahout
# cd mahout
- or -
# tar zxf mahout-distribution-0.4-src.tar.gz
# cd mahout-distribution-0.4
- then -
# mvn clean install

I've tested most of the Mahout examples on CHD3. I will also update the 
wiki once our 0.4 bits are released

On 10/26/10 4:57 AM, Isabel Drost wrote:
> On Mon, 25 Oct 2010 22:54:42 +0100
> Steven Bourke<sb...@gmail.com>  wrote:
>
>> Ted - Has mahout got an image up on EC2 that anyone can use or do we
>> have to build from scratch?
> None that I'm aware of, however building from scratch should be fairly
> easy:
>
> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Amazon+EC2
> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce
>
> Isabel
>


Re: Mahout dependencies on windows

Posted by Ted Dunning <te...@gmail.com>.
Sure.

After the release is done.

On Wed, Oct 27, 2010 at 4:06 AM, Grant Ingersoll <gs...@apache.org>wrote:

> Ted,
>
> Would you consider checking into Utils said script?  Saves the rest of us
> from replicating your work.
>
> -Grant
>
> On Oct 26, 2010, at 12:20 PM, Ted Dunning wrote:
>
> > Since the thing I usually want to run is the stuff I checked in most
> > recently, I generally put a startup script into the user data
> > for a standard alestic AMI.  This script does an svn checkout and build.
> >
> > For production sorts of things, I would put a tar file into s3 and simply
> > untar it at boot time.
> >
> > Building an AMI is vastly easier than it used to be, but I still prefer
> to
> > use a startup script that rains down my non-standard things
> > on a completely standard AMI.  This allows me to version control what is
> > special for my installations and allows me to change
> > underlying kernel independently of changes in my system special
> > configuration.  I hate coupling things like that if I can avoid it.
> >
> > On Tue, Oct 26, 2010 at 4:57 AM, Isabel Drost <is...@apache.org> wrote:
> >
> >> On Mon, 25 Oct 2010 22:54:42 +0100
> >> Steven Bourke <sb...@gmail.com> wrote:
> >>
> >>> Ted - Has mahout got an image up on EC2 that anyone can use or do we
> >>> have to build from scratch?
> >>
> >> None that I'm aware of, however building from scratch should be fairly
> >> easy:
> >>
> >> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Amazon+EC2
> >>
> >>
> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce
> >>
> >> Isabel
> >>
>
>

Re: Mahout dependencies on windows

Posted by Grant Ingersoll <gs...@apache.org>.
Ted,

Would you consider checking into Utils said script?  Saves the rest of us from replicating your work.

-Grant

On Oct 26, 2010, at 12:20 PM, Ted Dunning wrote:

> Since the thing I usually want to run is the stuff I checked in most
> recently, I generally put a startup script into the user data
> for a standard alestic AMI.  This script does an svn checkout and build.
> 
> For production sorts of things, I would put a tar file into s3 and simply
> untar it at boot time.
> 
> Building an AMI is vastly easier than it used to be, but I still prefer to
> use a startup script that rains down my non-standard things
> on a completely standard AMI.  This allows me to version control what is
> special for my installations and allows me to change
> underlying kernel independently of changes in my system special
> configuration.  I hate coupling things like that if I can avoid it.
> 
> On Tue, Oct 26, 2010 at 4:57 AM, Isabel Drost <is...@apache.org> wrote:
> 
>> On Mon, 25 Oct 2010 22:54:42 +0100
>> Steven Bourke <sb...@gmail.com> wrote:
>> 
>>> Ted - Has mahout got an image up on EC2 that anyone can use or do we
>>> have to build from scratch?
>> 
>> None that I'm aware of, however building from scratch should be fairly
>> easy:
>> 
>> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Amazon+EC2
>> 
>> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce
>> 
>> Isabel
>> 


Re: Mahout dependencies on windows

Posted by Ted Dunning <te...@gmail.com>.
Since the thing I usually want to run is the stuff I checked in most
recently, I generally put a startup script into the user data
for a standard alestic AMI.  This script does an svn checkout and build.

For production sorts of things, I would put a tar file into s3 and simply
untar it at boot time.

Building an AMI is vastly easier than it used to be, but I still prefer to
use a startup script that rains down my non-standard things
on a completely standard AMI.  This allows me to version control what is
special for my installations and allows me to change
underlying kernel independently of changes in my system special
configuration.  I hate coupling things like that if I can avoid it.

On Tue, Oct 26, 2010 at 4:57 AM, Isabel Drost <is...@apache.org> wrote:

> On Mon, 25 Oct 2010 22:54:42 +0100
> Steven Bourke <sb...@gmail.com> wrote:
>
> > Ted - Has mahout got an image up on EC2 that anyone can use or do we
> > have to build from scratch?
>
> None that I'm aware of, however building from scratch should be fairly
> easy:
>
> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Amazon+EC2
>
> https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce
>
> Isabel
>

Re: Mahout dependencies on windows

Posted by Isabel Drost <is...@apache.org>.
On Mon, 25 Oct 2010 22:54:42 +0100
Steven Bourke <sb...@gmail.com> wrote:

> Ted - Has mahout got an image up on EC2 that anyone can use or do we
> have to build from scratch?

None that I'm aware of, however building from scratch should be fairly
easy:

https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Amazon+EC2
https://cwiki.apache.org/confluence/display/MAHOUT/Mahout+on+Elastic+MapReduce

Isabel

Re: Mahout dependencies on windows

Posted by Steven Bourke <sb...@gmail.com>.
Ted - Has mahout got an image up on EC2 that anyone can use or do we have to
build from scratch?

On Mon, Oct 25, 2010 at 10:45 PM, Ted Dunning <te...@gmail.com> wrote:

> In my case, when I need to do some real computation, I just bit 25 cents an
> hour on a big EC2 instance.
>
> A few dollars, at most, later I have a big model built and my laptop is
> free
> the whole time.
>
> On Mon, Oct 25, 2010 at 2:14 PM, Lance Norskog <go...@gmail.com> wrote:
>
> > I also turn off paging in windows- if there's not enough ram, a
> > program fails. Windows does not go into molasses mode. This really
> > helps doing batch computes.
> >
>

Re: Mahout dependencies on windows

Posted by Ted Dunning <te...@gmail.com>.
In my case, when I need to do some real computation, I just bit 25 cents an
hour on a big EC2 instance.

A few dollars, at most, later I have a big model built and my laptop is free
the whole time.

On Mon, Oct 25, 2010 at 2:14 PM, Lance Norskog <go...@gmail.com> wrote:

> I also turn off paging in windows- if there's not enough ram, a
> program fails. Windows does not go into molasses mode. This really
> helps doing batch computes.
>

Re: Mahout dependencies on windows

Posted by Lance Norskog <go...@gmail.com>.
You need 4G. I use 32-bit java for space reasons, it helps some.

I also turn off paging in windows- if there's not enough ram, a
program fails. Windows does not go into molasses mode. This really
helps doing batch computes.

On Mon, Oct 25, 2010 at 2:12 PM, Lance Norskog <go...@gmail.com> wrote:
> Yes, you can run hadoop and mahout on windows. You have to install
> cywgin. You have to include /cygwin/bin in your PATH when running
> hadoop apps.
>
> I run hadoop and mahout out of eclipse on windows.
>
> On Sun, Oct 24, 2010 at 11:27 PM, Robin Anil <ro...@gmail.com> wrote:
>> Hadoop is not supported on windows, and Mahout is written completely on top
>> of Hadoop libraries. So we can't help you there. Maybe someone on this list
>> may have experience with hacking Mahout to work on windows
>>
>>
>> On Mon, Oct 25, 2010 at 11:34 AM, Divya <di...@k2associates.com.sg> wrote:
>>
>>> Hi,
>>>
>>> Is it must to install cygwin and Hadoop to work with Mahout.
>>>
>>> Cant I install/work with mahout standalone on windows.
>>>
>>>
>>>
>>>
>>>
>>> Regards,
>>>
>>> Divya
>>>
>>>
>>
>
>
>
> --
> Lance Norskog
> goksron@gmail.com
>



-- 
Lance Norskog
goksron@gmail.com

Re: Mahout dependencies on windows

Posted by Lance Norskog <go...@gmail.com>.
More on full Hadoop setup on Windows:

http://alans.se/blog/2010/hadoop-hbase-cygwin-windows-7-x64/

I only run the base hadoop stuff- hdfs/hbase etc. just sounds like
tedious 'put your card deck in the slot and the operator will run it
someday'.

On Mon, Oct 25, 2010 at 2:12 PM, Lance Norskog <go...@gmail.com> wrote:
> Yes, you can run hadoop and mahout on windows. You have to install
> cywgin. You have to include /cygwin/bin in your PATH when running
> hadoop apps.
>
> I run hadoop and mahout out of eclipse on windows.
>
> On Sun, Oct 24, 2010 at 11:27 PM, Robin Anil <ro...@gmail.com> wrote:
>> Hadoop is not supported on windows, and Mahout is written completely on top
>> of Hadoop libraries. So we can't help you there. Maybe someone on this list
>> may have experience with hacking Mahout to work on windows
>>
>>
>> On Mon, Oct 25, 2010 at 11:34 AM, Divya <di...@k2associates.com.sg> wrote:
>>
>>> Hi,
>>>
>>> Is it must to install cygwin and Hadoop to work with Mahout.
>>>
>>> Cant I install/work with mahout standalone on windows.
>>>
>>>
>>>
>>>
>>>
>>> Regards,
>>>
>>> Divya
>>>
>>>
>>
>
>
>
> --
> Lance Norskog
> goksron@gmail.com
>



-- 
Lance Norskog
goksron@gmail.com

Re: Mahout dependencies on windows

Posted by Lance Norskog <go...@gmail.com>.
Yes, you can run hadoop and mahout on windows. You have to install
cywgin. You have to include /cygwin/bin in your PATH when running
hadoop apps.

I run hadoop and mahout out of eclipse on windows.

On Sun, Oct 24, 2010 at 11:27 PM, Robin Anil <ro...@gmail.com> wrote:
> Hadoop is not supported on windows, and Mahout is written completely on top
> of Hadoop libraries. So we can't help you there. Maybe someone on this list
> may have experience with hacking Mahout to work on windows
>
>
> On Mon, Oct 25, 2010 at 11:34 AM, Divya <di...@k2associates.com.sg> wrote:
>
>> Hi,
>>
>> Is it must to install cygwin and Hadoop to work with Mahout.
>>
>> Cant I install/work with mahout standalone on windows.
>>
>>
>>
>>
>>
>> Regards,
>>
>> Divya
>>
>>
>



-- 
Lance Norskog
goksron@gmail.com