You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Deneche A. Hakim (JIRA)" <ji...@apache.org> on 2009/09/06 10:01:03 UTC

[jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

    [ https://issues.apache.org/jira/browse/MAHOUT-145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751842#action_12751842 ] 

Deneche A. Hakim commented on MAHOUT-145:
-----------------------------------------

bq.    *  TODO: test the code on a Hadoop 0.20.0 cluster (EC2)

Looks like I'll have to wait till Hadoop 0.20.1 to be able to test on EC2...after creating my own API (with a lot of pain, being a noob), I stumbled upon the following bug [HADOOP-5921|http://issues.apache.org/jira/browse/HADOOP-5921]



> PartialData mapreduce Random Forests
> ------------------------------------
>
>                 Key: MAHOUT-145
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-145
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>            Reporter: Deneche A. Hakim
>            Priority: Minor
>         Attachments: partial_August_10.patch, partial_August_13.patch, partial_August_15.patch, partial_August_17.patch, partial_August_19.patch, partial_August_2.patch, partial_August_24.patch, partial_August_27.patch, partial_August_31.patch, partial_August_9.patch
>
>
> This implementation is based on a suggestion by Ted:
> "modify the original algorithm to build multiple trees for different portions of the data. That loses some of the solidity of the original method, but could actually do better if the splits exposed non-stationary behavior."

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

Posted by deneche abdelhakim <a_...@yahoo.fr>.
I'll try...may take some time but I 'll surely learn a lot (will also need a refill on my pain killers)

--- En date de : Dim 6.9.09, Ted Dunning <te...@gmail.com> a écrit :

> De: Ted Dunning <te...@gmail.com>
> Objet: Re: [jira] Commented: (MAHOUT-145) PartialData mapreduce Random  Forests
> À: mahout-dev@lucene.apache.org
> Date: Dimanche 6 Septembre 2009, 17h49
> That fix has been created.  Can
> you just use trunk?
> 
> What about using the Yahoo 0.20 distribution?  (
> http://developer.yahoo.com/hadoop/distribution/ )
> 
> On Sun, Sep 6, 2009 at 1:01 AM, Deneche A. Hakim (JIRA)
> <ji...@apache.org>wrote:
> 
> >
> > bq.    *  TODO: test the code on a
> Hadoop 0.20.0 cluster (EC2)
> >
> > Looks like I'll have to wait till Hadoop 0.20.1 to be
> able to test on
> > EC2...after creating my own API (with a lot of pain,
> being a noob), I
> > stumbled upon the following bug [HADOOP-5921|
> > http://issues.apache.org/jira/browse/HADOOP-5921]
> >
> >
> 


      


Re: [jira] Commented: (MAHOUT-145) PartialData mapreduce Random Forests

Posted by Ted Dunning <te...@gmail.com>.
That fix has been created.  Can you just use trunk?

What about using the Yahoo 0.20 distribution?  (
http://developer.yahoo.com/hadoop/distribution/ )

On Sun, Sep 6, 2009 at 1:01 AM, Deneche A. Hakim (JIRA) <ji...@apache.org>wrote:

>
> bq.    *  TODO: test the code on a Hadoop 0.20.0 cluster (EC2)
>
> Looks like I'll have to wait till Hadoop 0.20.1 to be able to test on
> EC2...after creating my own API (with a lot of pain, being a noob), I
> stumbled upon the following bug [HADOOP-5921|
> http://issues.apache.org/jira/browse/HADOOP-5921]
>
>