You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Viktor Gal (Commented) (JIRA)" <ji...@apache.org> on 2012/02/05 22:48:00 UTC

[jira] [Commented] (MAHOUT-968) Classifier based on restricted boltzmann machines

    [ https://issues.apache.org/jira/browse/MAHOUT-968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13200920#comment-13200920 ] 

Viktor Gal commented on MAHOUT-968:
-----------------------------------

any patch available for test? as i'd gladly give it a go for testing...thnx
                
> Classifier based on restricted boltzmann machines
> -------------------------------------------------
>
>                 Key: MAHOUT-968
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-968
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>            Reporter: Dirk Weißenborn
>              Labels: classification, mnist
>   Original Estimate: 336h
>  Remaining Estimate: 336h
>
> This is a proposal for a new classifier based on restricted boltzmann machines. The development of this feature follows the paper on "Deep Boltzmann Machines" (DBM) [1] from 2009. The proposed model (DBM) got an error rate of 0.95% on the mnist dataset [2], which is really good. Main parts of the implementation should also be applicable to other scenarios than classification where restricted boltzmann machines are used (ref. MAHOUT-375).
> I am working on this feature right now, and the results are promising. The only problem with the training algorithm is, that it is still mostly sequential (if training batches are small, what they should be), which makes Map/Reduce until now, not really beneficial. However, since the algorithm itself is fast (for a training algorithm), training can be done on a single machine in managable time.
> Testing of the algorithm is currently done on the mnist dataset itself to reproduce results of [1]. As soon as results indicate, that everything is working fine, I will upload the patch.
> [1] http://www.cs.toronto.edu/~hinton/absps/dbm.pdf
> [2] http://yann.lecun.com/exdb/mnist/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

Re: [jira] [Commented] (MAHOUT-968) Classifier based on restricted boltzmann machines

Posted by Dirk Weissenborn <di...@googlemail.com>.
on it! i am running my own test right now on the mnist testset, and i think
in the next few days, i could upload the patch. but i still got a little
testing going on. I ll let you know when it is ready!

2012/2/5 Viktor Gal (Commented) (JIRA) <ji...@apache.org>

>
>    [
> https://issues.apache.org/jira/browse/MAHOUT-968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13200920#comment-13200920]
>
> Viktor Gal commented on MAHOUT-968:
> -----------------------------------
>
> any patch available for test? as i'd gladly give it a go for testing...thnx
>
> > Classifier based on restricted boltzmann machines
> > -------------------------------------------------
> >
> >                 Key: MAHOUT-968
> >                 URL: https://issues.apache.org/jira/browse/MAHOUT-968
> >             Project: Mahout
> >          Issue Type: New Feature
> >          Components: Classification
> >            Reporter: Dirk Weißenborn
> >              Labels: classification, mnist
> >   Original Estimate: 336h
> >  Remaining Estimate: 336h
> >
> > This is a proposal for a new classifier based on restricted boltzmann
> machines. The development of this feature follows the paper on "Deep
> Boltzmann Machines" (DBM) [1] from 2009. The proposed model (DBM) got an
> error rate of 0.95% on the mnist dataset [2], which is really good. Main
> parts of the implementation should also be applicable to other scenarios
> than classification where restricted boltzmann machines are used (ref.
> MAHOUT-375).
> > I am working on this feature right now, and the results are promising.
> The only problem with the training algorithm is, that it is still mostly
> sequential (if training batches are small, what they should be), which
> makes Map/Reduce until now, not really beneficial. However, since the
> algorithm itself is fast (for a training algorithm), training can be done
> on a single machine in managable time.
> > Testing of the algorithm is currently done on the mnist dataset itself
> to reproduce results of [1]. As soon as results indicate, that everything
> is working fine, I will upload the patch.
> > [1] http://www.cs.toronto.edu/~hinton/absps/dbm.pdf
> > [2] http://yann.lecun.com/exdb/mnist/
>
> --
> This message is automatically generated by JIRA.
> If you think it was sent incorrectly, please contact your JIRA
> administrators:
> https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
> For more information on JIRA, see: http://www.atlassian.com/software/jira
>
>
>