You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Grant Ingersoll (JIRA)" <ji...@apache.org> on 2008/08/15 22:53:44 UTC

[jira] Commented: (MAHOUT-74) Fuzzy K-Means clustering

    [ https://issues.apache.org/jira/browse/MAHOUT-74?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12623018#action_12623018 ] 

Grant Ingersoll commented on MAHOUT-74:
---------------------------------------

Couple of questions:

1. What's the urlCount for on SoftCluster?

2.  Shouldn't SoftCluster.m be non-final (and configurable.)

3.  It seems like there should be an opportunity for more inheritance/overlap, etc. w/ the K-Means clustering, but I'd have to think about it a bit more.  

The wikipedia article implies that m == 1 is "similar" to KMeans, is it the case that we could make KMeans just be a special case of fuzzy k means through the appropriate choosing of parameters?

Otherwise, the tests pass and it looks to be in pretty good shape.  Would be cool to have an example added, but not required for this patch to go in.



> Fuzzy K-Means clustering
> ------------------------
>
>                 Key: MAHOUT-74
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-74
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Clustering
>            Reporter: Pallavi Palleti
>            Assignee: Grant Ingersoll
>         Attachments: mahout-74.patch, mahout-74.patch
>
>
> Fuzzy KMeans clustering algorithm is an extension to traditional K Means clustering algorithm and performs soft clustering.
> More details about fuzzy k-means can be found here :http://en.wikipedia.org/wiki/Data_clustering#Fuzzy_c-means_clustering
> I have implemented fuzzy K-Means prototype and tests in org.apache.mahout.clustering.fuzzykmeans

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.