You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hama.apache.org by Apache Wiki <wi...@apache.org> on 2013/04/20 14:56:11 UTC

[Hama Wiki] Update of "Partitioning" by edwardyoon

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hama Wiki" for change notification.

The "Partitioning" page has been changed by edwardyoon:
http://wiki.apache.org/hama/Partitioning?action=diff&rev1=12&rev2=13

  == Partition Function ==
+ 
+  * '''NOTE: if when the number of splits exceeds the maximum number of tasks?'''.
  
  In Hama BSP computing framework, the Partition function is used for obtaining scalability of a Bulk Synchronous Parallel processing, and determining how to distribute the slices of input data among BSP processors. Unlike Map/Reduce data processing model, many scientific algorithms based on Message-Passing Bulk Synchronous Parallel model often requires that a processor obtain “nearby or related” data from other processors in order to complete the computation. In this case, you can create your own Partition function for determining processor inter-communication and how to distribute the data.
  
@@ -27, +29 @@

  
  === Specify the partition files and directories ===
  
- 
- 
  If the input is already partitioned, you can skip pre-partitioning step as following configuration:
  
  {{{