You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Bikash Gupta <bi...@gmail.com> on 2014/02/14 18:55:23 UTC

Approach to map the clustering output with existing data indexes

I am newbie to Mahout and working on a data mining clustering use case
using K-Means. I need a help to understand how to map the original
data with the clustered output to gain more insight. Let say

After performing data preparation we have a summarized data set having
following attributes

Key1,Key2,Dimension1,Dimension2,Measure1,Measure2,Measure3

Now I have executed clustering algorithm on following attributes

Measure1,Measure2,Measure3

Output of the clustering would be Cluster Id with its
data(Measure1,Measure2,Measure3).

Question: How can I perform clustering on specific attributes in
dataset, where the clustered output must contain all attributes.

Request to help me with right approach.

-- 
Regards
Bikash Gupta