You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Sznajder ForMailingList <bs...@gmail.com> on 2014/02/02 13:32:02 UTC

Meaning of seqdump output on a cluster file

Hi,

I am using Mahout0.5 (the version corresponding to the mahout in action
book)

I ran a K-means clustering and ran then seqdump on the clusters file.
here is an output sample

Input Path: log-kmeans-clusters-monogram-sim_0_1/clusters-9/part-r-00000
Key class: class org.apache.hadoop.io.Text Value Class: class
org.apache.mahout.clustering.kmeans.Cluster
Key: VL-513: Value: VL-513{n=26 c=[72:0.308, 404:0.354, ....


What is please the meaning of the number 72, 404 etc...

Can I map them to the initial document text?

Benjamin