You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2010/09/24 21:52:37 UTC

[jira] Commented: (MAHOUT-236) Cluster Evaluation Tools

    [ https://issues.apache.org/jira/browse/MAHOUT-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12914620#action_12914620 ] 

Hudson commented on MAHOUT-236:
-------------------------------

Integrated in Mahout-Quality #322 (See [https://hudson.apache.org/hudson/job/Mahout-Quality/322/])
    MAHOUT-236 
- Implemented ClusterEvaluator that uses Mahout In Action code for
inter-cluster density and similar code for intra-cluster density over a set of 
representative points, not the entire clustered data set.
- Generalized CDbwDriver etc to RepresentativePointsDriver so any cluster
evaluator tool can use them
- Added cluster pruning to CDbwEvaluator and ClusterEvaluator that removes
clusters which cause numerical instabilities in the evaluation
- Added unit tests. All tests run


> Cluster Evaluation Tools
> ------------------------
>
>                 Key: MAHOUT-236
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-236
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Clustering
>            Reporter: Grant Ingersoll
>            Assignee: Jeff Eastman
>             Fix For: 0.5
>
>         Attachments: MAHOUT-236.patch, MAHOUT-236.patch, MAHOUT-236.patch, MAHOUT-236.patch, MAHOUT-236.patch
>
>
> Per http://www.lucidimagination.com/search/document/10b562f10288993c/validating_clustering_output#9d3f6a55f4a91cb6, it would be great to have some utilities to help evaluate the effectiveness of clustering.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.