You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Pavan Sudheendra <pa...@gmail.com> on 2014/05/05 11:16:42 UTC

Mahout Clustering works fine on dev, but not on cluster

I'm trying to run canopyClustering followed by KMeansClustering

Here is the code:

*Path canopyCentroids    = new Path(clusteringOutputFolder,
> "canopy-centroids");*

*  Path clusterOutput      = new Path(clusteringOutputFolder, "clusters"); *

> *CanopyDriver.run (inputVectors, canopyCentroids, new
> TanimotoDistanceMeasure(),**0.95 ,**1.05 ,*
> * true, 0.0, false); *
>             *KMeansDriver.run (conf, inputVectors, new
> Path(canopyCentroids + "/clusters-0-final"), clusterOutput, **new
> TanimotoDistanceMeasure(), 0.01, 20, true, **0,false);*


Whenever i run this on my dev machine, it runs fine but when i export it to
the cluster and run, i always get this error message:

j
> *ava.lang.RuntimeException: java.lang.RuntimeException:
> java.lang.InterruptedException: Cluster Iteration 1 failed processing*
> *attempt_201404230954_13426_r_000001_0: SLF4J:
> See http://www.slf4j.org/codes.html#multiple_bindings
> <http://www.slf4j.org/codes.html#multiple_bindings>for an explanation. *
> *attempt_201404230954_13426_r_000001_0:
> org.apache.mahout.math.CardinalityException: Required cardinality 370 but
> got 356*
> *attempt_201404230954_13426_r_000001_0:  at
> org.apache.mahout.math.AbstractVector.dot(AbstractVector.java:180) *
> *attempt_201404230954_13426_r_000001_0:  at
> org.apache.mahout.common.distance.TanimotoDistanceMeasure.distance(TanimotoDistanceMeasure.java:49)*
> *attempt_201404230954_13426_r_000001_0:  at
> org.apache.mahout.clustering.iterator.DistanceMeasureCluster.pdf(DistanceMeasureCluster.java:66)
> *
> *attempt_201404230954_13426_r_000001_0:  at
> org.apache.mahout.clustering.iterator.DistanceMeasureCluster.pdf(DistanceMeasureCluster.java:32)*
> *attempt_201404230954_13426_r_000001_0: at
> org.apache.mahout.clustering.iterator.AbstractClusteringPolicy.classify(AbstractClusteringPolicy.java:59)
> *
> *attempt_201404230954_13426_r_000001_0:  at
> org.apache.mahout.clustering.classify.ClusterClassifier.classify(ClusterClassifier.java:99)*
> *attempt_201404230954_13426_r_000001_0:  at
> org.apache.mahout.clustering.iterator.CIMapper.map(CIMapper.java:36) **attempt_201404230954_13426_r_000001_0:
> at org.apache.mahout.clustering.iterator.CIMapper.map(CIMapper.java:17)*


Please help. I don't know where i'm going wrong here.

-- 
Regards-
Pavan