You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Delroy Cameron <de...@gmail.com> on 2010/05/28 23:57:42 UTC
Re: Kmeans clustering
indeed, for k-means clustering you should specify the path to the clusters
directory where the binary part-00000 file is, and not the actual binary
file as the input for the sequence file (-s).
that is
<path to clusters output>/clusters-<last iteration number>/
instead of
<path to clusters output>/clusters-<last iteration number>/part-00000
clusterdump worked fine for me with the following command
./bin/mahout clusterdump
-s <path to clusters output>/clusters-<last iteration number>/ \
-o <path for dump file> \
-p <path to clusters output>/clusteredPoints/ \
-d <path to input vectors>/dictionary.file-0 \
-dt sequencefile
-----
--cheers
Delroy
--
View this message in context: http://lucene.472066.n3.nabble.com/Kmeans-clustering-tp641973p853194.html
Sent from the Mahout User List mailing list archive at Nabble.com.