You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-commits@hadoop.apache.org by Apache Wiki <wi...@apache.org> on 2006/07/06 01:25:24 UTC

[Lucene-hadoop Wiki] Update of "Sort" by OwenOMalley

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.

The following page has been changed by OwenOMalley:
http://wiki.apache.org/lucene-hadoop/Sort

------------------------------------------------------------------------------
  To run the program:[[BR]]
  bin/hadoop jar hadoop-*-examples.jar sort [-m <#maps>] [-r <#reduces>] <in-dir> <out-dir>
  
+ == Running Sort Benchmark ==
+ 
+ To use the sort example as a benchmark, generate 10GB/node of random data using RandomWriter. Then sort the data using ["Sort"]. This provides a sort benchmark that scales depending on the size of the cluster. By default, the ["Sort"] programs uses 1.0 * capacity for the number of reduces and depending on your cluster you may see better results at 1.75 * capacity.
+