You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by myn <my...@163.com> on 2011/09/13 18:15:56 UTC

why so many place does`t set job.setNumReduceTasks

  private static void startDFCounting(Path input, Path output, Configuration baseConf,int numReducers)

  private static void makePartialVectors(Path input,

meanshift cluster
and so on....
 
and so many place ,why?  hadoop default is 2 reduce, but my data is 3 billon ,2 reduce is so slowly.