You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Isabel Drost (Commented) (JIRA)" <ji...@apache.org> on 2012/03/20 22:47:39 UTC

[jira] [Commented] (MAHOUT-716) Implement Boosting

    [ https://issues.apache.org/jira/browse/MAHOUT-716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13233818#comment-13233818 ] 

Isabel Drost commented on MAHOUT-716:
-------------------------------------

After not much activity - took a brief look at the patch. Some comments (to be taken with a grain of salt after I didn't have the cycles to follow the project as closely as I would have liked in the past months):

You mentioned a forked git repo on github - is it still online?

So far looks like a rather isolated change. Would it make sense to integrate it with existing classification APIs e.g. org.apache.mahout.classifier.AbstractVectorClassifier?

Also some more documentation and a usage example for the un-initiated would be great: In addition to links to one or two publications the implementation is based on it's always great to have some information on the strengths and weaknesses of the implemented solution (yes, I know we are doing pretty badly along these lines with other bits and pieces we have - still would be nice to have).
                
> Implement Boosting
> ------------------
>
>                 Key: MAHOUT-716
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-716
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.5
>            Reporter: Hector Yee
>            Assignee: Ted Dunning
>            Priority: Minor
>              Labels: features
>         Attachments: MAHOUT-716.patch, MAHOUT-716.patch
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> Implement boosting (grad boost variant) with l1-regularization and induction.
> The gradient part is scalable and parallel and the induction part allows stochastic hypothesis generation for speed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira