You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by Grant Ingersoll <gs...@apache.org> on 2011/11/03 16:20:24 UTC

Fwd: Minhash key groups

Anyone know this?

Begin forwarded message:

> From: Grant Ingersoll <gs...@apache.org>
> Subject: Minhash key groups
> Date: November 2, 2011 10:20:57 AM EDT
> To: user@mahout.apache.org
> 
> What's the Minhash key groups value used for in the MinhashDriver?  I mean, I see it is used for building up the key out of the hashed values, but what's the significance of different values for it?  The default is 2, what does it mean practically speaking if I choose, say, 10?  AFAICT, it would mean that I would have more clusters, assuming that we still meet the minimum cluster size imposed by the reducer?
> 
> Thanks,
> Grant