You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Ted Dunning (JIRA)" <ji...@apache.org> on 2008/12/26 22:59:44 UTC

[jira] Issue Comment Edited: (MAHOUT-30) dirichlet process implementation

    [ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12659273#action_12659273 ] 

tdunning edited comment on MAHOUT-30 at 12/26/08 1:57 PM:
-------------------------------------------------------------

One good reference is this relatively dense article by McCullagh and Yang:

http://ba.stat.cmu.edu/journal/2008/vol03/issue01/yang.pdf

There is also a more approachable example in Chris Bishop's book on Machine Learning.  See http://research.microsoft.com/en-us/um/people/cmbishop/PRML/index.htm.  I think that chapter 9 is where the example of clustering using a mixture model is found.

The Neal and Blei references from the McCullagh and Yang paper are also good.  Zoubin Gharamani has some very nice tutorials out which describe why non-parametric Bayesian approaches to problems are very cool.  One is at http://learning.eng.cam.ac.uk/zoubin/talks/uai05tutorial-b.pdf but here are video versions about as well.

      was (Author: tdunning):
    
Some references include a relatively dense article by McCullagh and Yang:

http://ba.stat.cmu.edu/journal/2008/vol03/issue01/yang.pdf

There is also a more approachable example in Chris Bishop's book on Machine Learning.  See http://research.microsoft.com/en-us/um/people/cmbishop/PRML/index.htm.  I think that chapter 9 is where the example of clustering using a mixture model is found.

The Neal and Blei references from the McCullagh and Yang paper are also good.  Zoubin Gharamani has some very nice tutorials out which describe why non-parametric Bayesian approaches to problems are very cool.  One is at http://learning.eng.cam.ac.uk/zoubin/talks/uai05tutorial-b.pdf but here are video versions about as well.
  
> dirichlet process implementation
> --------------------------------
>
>                 Key: MAHOUT-30
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-30
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Clustering
>            Reporter: Isabel Drost
>            Assignee: Jeff Eastman
>         Attachments: jeastman.vcf, MAHOUT-30.patch, MAHOUT-30b.patch, MAHOUT-30c.patch
>
>
> Copied over from original issue:
> > Further extension can also be made by assuming an infinite mixture model. The implementation is only slightly more difficult and the result is a (nearly)
> > non-parametric clustering algorithm.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.