You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Hudson (Commented) (JIRA)" <ji...@apache.org> on 2012/02/23 02:31:49 UTC

[jira] [Commented] (MAHOUT-933) Implement mapreduce version of ClusterIterator

    [ https://issues.apache.org/jira/browse/MAHOUT-933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13214175#comment-13214175 ] 

Hudson commented on MAHOUT-933:
-------------------------------

Integrated in Mahout-Quality #1362 (See [https://builds.apache.org/job/Mahout-Quality/1362/])
    MAHOUT-933: Refactored actual classification out of ClusterClassifier and into ClusteringPolicies. This
allows classifier to be completely generic as to the algorithm and gives policies correct use of e.g. fuzzyK 'm'
Introduced Canopy and MeanShift clustering policies for classification though not used by cluster iterator
Modified serialization of ClusterClassifiers to include ClusteringPolicy
Added ClusterClassifier serialization methods to exploded sequenceFile representation needed for MR
Updated Display examples and unit tests. All run (Revision 1292563)

     Result = FAILURE
jeastman : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1292563
Files : 
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/CIMapper.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/CanopyClusteringPolicy.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/ClusterClassifier.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/ClusterIterator.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/ClusteringPolicy.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/DirichletClusteringPolicy.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/FuzzyKMeansClusteringPolicy.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/KMeansClusteringPolicy.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/MeanShiftClusteringPolicy.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/classify/ClusterClassificationDriver.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/classify/ClusterClassificationMapper.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/fuzzykmeans/FuzzyKMeansClusterer.java
* /mahout/trunk/core/src/test/java/org/apache/mahout/clustering/TestClusterClassifier.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/clustering/display/DisplayDirichlet.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/clustering/display/DisplayFuzzyKMeans.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/clustering/display/DisplayKMeans.java

                
> Implement mapreduce version of ClusterIterator
> ----------------------------------------------
>
>                 Key: MAHOUT-933
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-933
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification, Clustering
>    Affects Versions: 0.6
>            Reporter: Paritosh Ranjan
>            Assignee: Jeff Eastman
>             Fix For: 0.7
>
>
> Right now, ClusterIterator consumes vectors only from in-memory and sequential hdfs. A mapreduce version to consume vectors needs to be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira