You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2010/09/24 21:52:37 UTC
[jira] Commented: (MAHOUT-236) Cluster Evaluation Tools
[ https://issues.apache.org/jira/browse/MAHOUT-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12914620#action_12914620 ]
Hudson commented on MAHOUT-236:
-------------------------------
Integrated in Mahout-Quality #322 (See [https://hudson.apache.org/hudson/job/Mahout-Quality/322/])
MAHOUT-236
- Implemented ClusterEvaluator that uses Mahout In Action code for
inter-cluster density and similar code for intra-cluster density over a set of
representative points, not the entire clustered data set.
- Generalized CDbwDriver etc to RepresentativePointsDriver so any cluster
evaluator tool can use them
- Added cluster pruning to CDbwEvaluator and ClusterEvaluator that removes
clusters which cause numerical instabilities in the evaluation
- Added unit tests. All tests run
> Cluster Evaluation Tools
> ------------------------
>
> Key: MAHOUT-236
> URL: https://issues.apache.org/jira/browse/MAHOUT-236
> Project: Mahout
> Issue Type: New Feature
> Components: Clustering
> Reporter: Grant Ingersoll
> Assignee: Jeff Eastman
> Fix For: 0.5
>
> Attachments: MAHOUT-236.patch, MAHOUT-236.patch, MAHOUT-236.patch, MAHOUT-236.patch, MAHOUT-236.patch
>
>
> Per http://www.lucidimagination.com/search/document/10b562f10288993c/validating_clustering_output#9d3f6a55f4a91cb6, it would be great to have some utilities to help evaluate the effectiveness of clustering.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.