You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Terry Blankers <te...@amritanet.com> on 2014/04/18 19:29:13 UTC

Re: clusterdump samplePoints parameter

Can you please clarify as to whether the points are somehow ordered if 
the number of points are specified? In other words, suppose I set max 
points = 100 and there are 1000 points in a cluster. Which 100 of the 
1000 points are returned? Alphanumeric sort of point ID, etc?



On 3/18/14, 11:41 PM, Suneel Marthi wrote:
> Its the max. no. of points to include from each cluster in the clusterdump. If not specified all points would be included.
>
>
>
>
>
> On Tuesday, March 18, 2014 11:25 PM, Terry Blankers <te...@amritanet.com> wrote:
>   
> Hi all,
>
> Can someone please answer a quick question about the --samplePoints
> parameter in the clusterdump utility? I understand it specifies the
> number of points returned per cluster. But are the points per cluster
> ordered or ranked in any way before this truncation occurs?
>
> Thanks,
>
> Terry