You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Delroy Cameron <de...@gmail.com> on 2010/05/28 23:57:42 UTC

Re: Kmeans clustering

indeed, for k-means clustering you should specify the path to the clusters
directory where the binary part-00000 file is, and not the actual binary
file as the input for the sequence file (-s).

that is
<path to clusters output>/clusters-<last iteration number>/
instead of 
<path to clusters output>/clusters-<last iteration number>/part-00000

clusterdump worked fine for me with the following command
./bin/mahout clusterdump 
-s <path to clusters output>/clusters-<last iteration number>/ \
-o <path for dump file> \
-p <path to clusters output>/clusteredPoints/ \
-d <path to input vectors>/dictionary.file-0 \
-dt sequencefile


-----
--cheers
Delroy
-- 
View this message in context: http://lucene.472066.n3.nabble.com/Kmeans-clustering-tp641973p853194.html
Sent from the Mahout User List mailing list archive at Nabble.com.