You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by myn <my...@163.com> on 2011/09/13 18:15:56 UTC
why so many place does`t set job.setNumReduceTasks
private static void startDFCounting(Path input, Path output, Configuration baseConf,int numReducers)
private static void makePartialVectors(Path input,
meanshift cluster
and so on....
and so many place ,why? hadoop default is 2 reduce, but my data is 3 billon ,2 reduce is so slowly.