You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Sameer Tilak <ss...@live.com> on 2013/12/20 23:33:30 UTC

clusterdump

Hi All,
I was able to do the clustering and need some help with viewing the result. I get the following problem.

./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -d /scratch/dummyvectorfinalclusters
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Warning: $HADOOP_HOME is deprecated.

Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
Warning: $HADOOP_HOME is deprecated.

13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
Unknown program 'clusterdump' chosen.
Valid program names are:
  baumwelch: : Baum-Welch algorithm for unsupervised HMM training
  canopy: : Canopy clustering
  cleansvd: : Cleanup and verification of SVD output
  clusterpp: : Groups Clustering Output In Clusters
  cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
  cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
  evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
  fkmeans: : Fuzzy K-means clustering
  hmmpredict: : Generate random sequence of observations by given HMM
  itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
  kmeans: : K-means clustering
  matrixmult: : Take the product of two matrices
  parallelALS: : ALS-WR factorization of a rating matrix
  recommendfactorized: : Compute recommendations using the factorization of a rating matrix
  recommenditembased: : Compute recommendations using item-based collaborative filtering
  rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
  seq2encoded: : Encoded Sparse Vector generation from Text sequence files
  seq2sparse: : Sparse Vector generation from Text sequence files
  spectralkmeans: : Spectral k-means clustering
  splitDataset: : split a rating dataset into training and probe parts
  ssvd: : Stochastic SVD
  streamingkmeans: : Streaming k-means clustering
  svd: : Lanczos Singular Value Decomposition
  testnb: : Test the Vector-based Bayes classifier
  trainnb: : Train the Vector-based Bayes classifier
  transpose: : Take the transpose of a matrix
  vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
  viterbi: : Viterbi decoding of hidden states from given output states sequence 		 	   		  

RE: clusterdump

Posted by Sameer Tilak <ss...@live.com>.
Hi All,

My HADOOP_CLASSPATH was interfering somehow. Things seem to work fine now.

-bash-4.1$ export HADOOP_CLASSPATH=""

./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final --pointsDir /scratch/clusterdump
MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
Warning: $HADOOP_HOME is deprecated.

13/12/20 15:06:15 INFO common.AbstractJob: Command line arguments: {--dictionaryType=[text], --distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure], --endPhase=[2147483647], --input=[/scratch/dummyvectoroutput/clusters-*-final], --outputFormat=[TEXT], --pointsDir=[/scratch/clusterdump], --startPhase=[0], --tempDir=[temp]}
CL-92{n=10 c=[343.032, 272.783, 78.239, 4.934, 54.654] r=[72.995, 74.388, 75.692, 14.803, 80.172]}
CL-7{n=34 c=[61.475, 67.234, 94.989, 75.609, 267.051] r=[80.386, 84.565, 124.621, 86.960, 90.146]}
CL-98{n=30 c=[28.038, 81.483, 145.317, 269.980, 52.420] r=[43.357, 114.179, 136.547, 119.696, 84.281]}
CL-3{n=8 c=[339.604, 28.429, 124.278, 61.143, 84.997] r=[73.463, 44.537, 128.509, 40.645, 100.324]}
VL-46{n=18 c=[58.082, 299.551, 79.124, 65.438, 39.663] r=[61.926, 96.523, 91.026, 91.622, 66.675]}
13/12/20 15:06:16 INFO clustering.ClusterDumper: Wrote 5 clusters
13/12/20 15:06:16 INFO driver.MahoutDriver: Program took 841 ms (Minutes: 0.014016666666666667)

> Date: Fri, 20 Dec 2013 15:02:13 -0800
> From: suneel_marthi@yahoo.com
> Subject: Re: clusterdump
> To: user@mahout.apache.org
> 
> I would investigate all of those 'Unable to add .....' messages first. Checkout the latest code and run a clean build.
> 
> 
> 
> 
> 
> On Friday, December 20, 2013 5:58 PM, Sameer Tilak <ss...@live.com> wrote:
>  
> Suneel:
> Yes, I am working off of trunk. I saw that example. In my case the data is numeric -- I assume that means no need for dictionary etc . I am not sure what is going on, but I still get the following errors:
> 
> ./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -o /scratch/clusterdump
> MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
> Warning: $HADOOP_HOME is deprecated.
> 
> Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
> MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
> Warning: $HADOOP_HOME is deprecated.
> 
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
> 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
> Unknown program 'clusterdump' chosen.
> Valid program names are:
>   baumwelch: : Baum-Welch algorithm for unsupervised HMM training
>   canopy: : Canopy clustering
>   cleansvd: : Cleanup and verification of SVD output
>   clusterpp: : Groups Clustering Output In Clusters
>   cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
>   cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
>   evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
>   fkmeans: : Fuzzy K-means clustering
>   hmmpredict: : Generate random sequence of observations by given HMM
>   itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
>   kmeans: : K-means clustering
>   matrixmult: : Take the product of two matrices
>   parallelALS: : ALS-WR factorization of a rating matrix
>   recommendfactorized: : Compute recommendations using the factorization of a rating matrix
>   recommenditembased: : Compute recommendations using item-based collaborative filtering
>   rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
>   seq2encoded: : Encoded Sparse Vector generation from Text sequence files
>   seq2sparse: : Sparse Vector generation from Text sequence files
>   spectralkmeans: : Spectral k-means clustering
>   splitDataset: : split a rating dataset into training and probe parts
>   ssvd: : Stochastic SVD
>   streamingkmeans: : Streaming k-means clustering
>   svd: : Lanczos Singular Value Decomposition
>   testnb: : Test the Vector-based Bayes classifier
>   trainnb: : Train the Vector-based Bayes classifier
>   transpose: : Take the transpose of a matrix
>   vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
>   viterbi: : Viterbi decoding of hidden states from given output states sequence
> 
> 
> > Date: Fri, 20 Dec 2013 14:42:33 -0800
> > From: suneel_marthi@yahoo.com
> > Subject: Re: clusterdump
> > To: user@mahout.apache.org
> > 
> > Are you working off of trunk? 'clusterdump' is being used in examples/bin/cluster-reuters.sh.
> > 
> > 
> > 
> > 
> > 
> > On Friday, December 20, 2013 5:33 PM, Sameer Tilak <ss...@live.com> wrote:
> >  
> > Hi All,
> > I was able to do the clustering and need some help with viewing the result. I get the following problem.
> > 
> > ./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -d /scratch/dummyvectorfinalclusters
> > MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
> > Warning: $HADOOP_HOME is deprecated.
> > 
> > Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
> > MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
> > Warning: $HADOOP_HOME is deprecated.
> > 
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
> > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
> > Unknown program 'clusterdump' chosen.
> > Valid program names are:
> >   baumwelch: : Baum-Welch algorithm for unsupervised HMM training
> >   canopy: : Canopy clustering
> >   cleansvd: : Cleanup and verification of SVD output
> >   clusterpp: : Groups Clustering Output In Clusters
> >   cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
> >   cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
> >   evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
> >   fkmeans: : Fuzzy K-means clustering
> >   hmmpredict: : Generate random sequence of observations by given HMM
> >   itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
> >   kmeans: : K-means clustering
> >   matrixmult: : Take the product of two matrices
> >   parallelALS: : ALS-WR factorization of a rating matrix
> >   recommendfactorized: : Compute recommendations using the factorization of a rating matrix
> >   recommenditembased: : Compute recommendations using item-based collaborative filtering
> >   rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
> >   seq2encoded: : Encoded Sparse Vector generation from Text sequence files
> >   seq2sparse: : Sparse Vector generation from Text sequence files
> >   spectralkmeans: : Spectral k-means clustering
> >   splitDataset: : split a rating dataset into training and probe parts
> >   ssvd: : Stochastic SVD
> >   streamingkmeans: : Streaming k-means clustering
> >   svd: : Lanczos Singular Value Decomposition
> >   testnb: : Test the Vector-based Bayes classifier
> >   trainnb: : Train the Vector-based Bayes classifier
> >   transpose: : Take the transpose of a matrix
> >   vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
> >   viterbi: : Viterbi decoding of hidden states from given output states sequence                          
 		 	   		  

Re: clusterdump

Posted by Suneel Marthi <su...@yahoo.com>.
I would investigate all of those 'Unable to add .....' messages first. Checkout the latest code and run a clean build.





On Friday, December 20, 2013 5:58 PM, Sameer Tilak <ss...@live.com> wrote:
 
Suneel:
Yes, I am working off of trunk. I saw that example. In my case the data is numeric -- I assume that means no need for dictionary etc . I am not sure what is going on, but I still get the following errors:

./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -o /scratch/clusterdump
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Warning: $HADOOP_HOME is deprecated.

Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
Warning: $HADOOP_HOME is deprecated.

13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
Unknown program 'clusterdump' chosen.
Valid program names are:
  baumwelch: : Baum-Welch algorithm for unsupervised HMM training
  canopy: : Canopy clustering
  cleansvd: : Cleanup and verification of SVD output
  clusterpp: : Groups Clustering Output In Clusters
  cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
  cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
  evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
  fkmeans: : Fuzzy K-means clustering
  hmmpredict: : Generate random sequence of observations by given HMM
  itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
  kmeans: : K-means clustering
  matrixmult: : Take the product of two matrices
  parallelALS: : ALS-WR factorization of a rating matrix
  recommendfactorized: : Compute recommendations using the factorization of a rating matrix
  recommenditembased: : Compute recommendations using item-based collaborative filtering
  rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
  seq2encoded: : Encoded Sparse Vector generation from Text sequence files
  seq2sparse: : Sparse Vector generation from Text sequence files
  spectralkmeans: : Spectral k-means clustering
  splitDataset: : split a rating dataset into training and probe parts
  ssvd: : Stochastic SVD
  streamingkmeans: : Streaming k-means clustering
  svd: : Lanczos Singular Value Decomposition
  testnb: : Test the Vector-based Bayes classifier
  trainnb: : Train the Vector-based Bayes classifier
  transpose: : Take the transpose of a matrix
  vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
  viterbi: : Viterbi decoding of hidden states from given output states sequence


> Date: Fri, 20 Dec 2013 14:42:33 -0800
> From: suneel_marthi@yahoo.com
> Subject: Re: clusterdump
> To: user@mahout.apache.org
> 
> Are you working off of trunk? 'clusterdump' is being used in examples/bin/cluster-reuters.sh.
> 
> 
> 
> 
> 
> On Friday, December 20, 2013 5:33 PM, Sameer Tilak <ss...@live.com> wrote:
>  
> Hi All,
> I was able to do the clustering and need some help with viewing the result. I get the following problem.
> 
> ./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -d /scratch/dummyvectorfinalclusters
> MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
> Warning: $HADOOP_HOME is deprecated.
> 
> Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
> MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
> Warning: $HADOOP_HOME is deprecated.
> 
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
> Unknown program 'clusterdump' chosen.
> Valid program names are:
>   baumwelch: : Baum-Welch algorithm for unsupervised HMM training
>   canopy: : Canopy clustering
>   cleansvd: : Cleanup and verification of SVD output
>   clusterpp: : Groups Clustering Output In Clusters
>   cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
>   cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
>   evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
>   fkmeans: : Fuzzy K-means clustering
>   hmmpredict: : Generate random sequence of observations by given HMM
>   itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
>   kmeans: : K-means clustering
>   matrixmult: : Take the product of two matrices
>   parallelALS: : ALS-WR factorization of a rating matrix
>   recommendfactorized: : Compute recommendations using the factorization of a rating matrix
>   recommenditembased: : Compute recommendations using item-based collaborative filtering
>   rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
>   seq2encoded: : Encoded Sparse Vector generation from Text sequence files
>   seq2sparse: : Sparse Vector generation from Text sequence files
>   spectralkmeans: : Spectral k-means clustering
>   splitDataset: : split a rating dataset into training and probe parts
>   ssvd: : Stochastic SVD
>   streamingkmeans: : Streaming k-means clustering
>   svd: : Lanczos Singular Value Decomposition
>   testnb: : Test the Vector-based Bayes classifier
>   trainnb: : Train the Vector-based Bayes classifier
>   transpose: : Take the transpose of a matrix
>   vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
>   viterbi: : Viterbi decoding of hidden states from given output states sequence                          

RE: clusterdump

Posted by Sameer Tilak <ss...@live.com>.
Suneel:
Yes, I am working off of trunk. I saw that example. In my case the data is numeric -- I assume that means no need for dictionary etc . I am not sure what is going on, but I still get the following errors:

./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -o /scratch/clusterdump
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Warning: $HADOOP_HOME is deprecated.

Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
Warning: $HADOOP_HOME is deprecated.

13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
Unknown program 'clusterdump' chosen.
Valid program names are:
  baumwelch: : Baum-Welch algorithm for unsupervised HMM training
  canopy: : Canopy clustering
  cleansvd: : Cleanup and verification of SVD output
  clusterpp: : Groups Clustering Output In Clusters
  cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
  cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
  evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
  fkmeans: : Fuzzy K-means clustering
  hmmpredict: : Generate random sequence of observations by given HMM
  itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
  kmeans: : K-means clustering
  matrixmult: : Take the product of two matrices
  parallelALS: : ALS-WR factorization of a rating matrix
  recommendfactorized: : Compute recommendations using the factorization of a rating matrix
  recommenditembased: : Compute recommendations using item-based collaborative filtering
  rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
  seq2encoded: : Encoded Sparse Vector generation from Text sequence files
  seq2sparse: : Sparse Vector generation from Text sequence files
  spectralkmeans: : Spectral k-means clustering
  splitDataset: : split a rating dataset into training and probe parts
  ssvd: : Stochastic SVD
  streamingkmeans: : Streaming k-means clustering
  svd: : Lanczos Singular Value Decomposition
  testnb: : Test the Vector-based Bayes classifier
  trainnb: : Train the Vector-based Bayes classifier
  transpose: : Take the transpose of a matrix
  vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
  viterbi: : Viterbi decoding of hidden states from given output states sequence

> Date: Fri, 20 Dec 2013 14:42:33 -0800
> From: suneel_marthi@yahoo.com
> Subject: Re: clusterdump
> To: user@mahout.apache.org
> 
> Are you working off of trunk? 'clusterdump' is being used in examples/bin/cluster-reuters.sh.
> 
> 
> 
> 
> 
> On Friday, December 20, 2013 5:33 PM, Sameer Tilak <ss...@live.com> wrote:
>  
> Hi All,
> I was able to do the clustering and need some help with viewing the result. I get the following problem.
> 
> ./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -d /scratch/dummyvectorfinalclusters
> MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
> Warning: $HADOOP_HOME is deprecated.
> 
> Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
> MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
> Warning: $HADOOP_HOME is deprecated.
> 
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
> 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
> Unknown program 'clusterdump' chosen.
> Valid program names are:
>   baumwelch: : Baum-Welch algorithm for unsupervised HMM training
>   canopy: : Canopy clustering
>   cleansvd: : Cleanup and verification of SVD output
>   clusterpp: : Groups Clustering Output In Clusters
>   cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
>   cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
>   evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
>   fkmeans: : Fuzzy K-means clustering
>   hmmpredict: : Generate random sequence of observations by given HMM
>   itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
>   kmeans: : K-means clustering
>   matrixmult: : Take the product of two matrices
>   parallelALS: : ALS-WR factorization of a rating matrix
>   recommendfactorized: : Compute recommendations using the factorization of a rating matrix
>   recommenditembased: : Compute recommendations using item-based collaborative filtering
>   rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
>   seq2encoded: : Encoded Sparse Vector generation from Text sequence files
>   seq2sparse: : Sparse Vector generation from Text sequence files
>   spectralkmeans: : Spectral k-means clustering
>   splitDataset: : split a rating dataset into training and probe parts
>   ssvd: : Stochastic SVD
>   streamingkmeans: : Streaming k-means clustering
>   svd: : Lanczos Singular Value Decomposition
>   testnb: : Test the Vector-based Bayes classifier
>   trainnb: : Train the Vector-based Bayes classifier
>   transpose: : Take the transpose of a matrix
>   vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
>   viterbi: : Viterbi decoding of hidden states from given output states sequence                           
 		 	   		  

Re: clusterdump

Posted by Suneel Marthi <su...@yahoo.com>.
Are you working off of trunk? 'clusterdump' is being used in examples/bin/cluster-reuters.sh.





On Friday, December 20, 2013 5:33 PM, Sameer Tilak <ss...@live.com> wrote:
 
Hi All,
I was able to do the clustering and need some help with viewing the result. I get the following problem.

./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -d /scratch/dummyvectorfinalclusters
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Warning: $HADOOP_HOME is deprecated.

Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf
MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
Warning: $HADOOP_HOME is deprecated.

13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob
13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver
Unknown program 'clusterdump' chosen.
Valid program names are:
  baumwelch: : Baum-Welch algorithm for unsupervised HMM training
  canopy: : Canopy clustering
  cleansvd: : Cleanup and verification of SVD output
  clusterpp: : Groups Clustering Output In Clusters
  cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx)
  cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally.
  evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes
  fkmeans: : Fuzzy K-means clustering
  hmmpredict: : Generate random sequence of observations by given HMM
  itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering
  kmeans: : K-means clustering
  matrixmult: : Take the product of two matrices
  parallelALS: : ALS-WR factorization of a rating matrix
  recommendfactorized: : Compute recommendations using the factorization of a rating matrix
  recommenditembased: : Compute recommendations using item-based collaborative filtering
  rowsimilarity: : Compute the pairwise similarities of the rows of a matrix
  seq2encoded: : Encoded Sparse Vector generation from Text sequence files
  seq2sparse: : Sparse Vector generation from Text sequence files
  spectralkmeans: : Spectral k-means clustering
  splitDataset: : split a rating dataset into training and probe parts
  ssvd: : Stochastic SVD
  streamingkmeans: : Streaming k-means clustering
  svd: : Lanczos Singular Value Decomposition
  testnb: : Test the Vector-based Bayes classifier
  trainnb: : Train the Vector-based Bayes classifier
  transpose: : Take the transpose of a matrix
  vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors
  viterbi: : Viterbi decoding of hidden states from given output states sequence