You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hama.apache.org by Apache Wiki <wi...@apache.org> on 2013/04/20 14:56:11 UTC
[Hama Wiki] Update of "Partitioning" by edwardyoon
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Hama Wiki" for change notification.
The "Partitioning" page has been changed by edwardyoon:
http://wiki.apache.org/hama/Partitioning?action=diff&rev1=12&rev2=13
== Partition Function ==
+
+ * '''NOTE: if when the number of splits exceeds the maximum number of tasks?'''.
In Hama BSP computing framework, the Partition function is used for obtaining scalability of a Bulk Synchronous Parallel processing, and determining how to distribute the slices of input data among BSP processors. Unlike Map/Reduce data processing model, many scientific algorithms based on Message-Passing Bulk Synchronous Parallel model often requires that a processor obtain “nearby or related” data from other processors in order to complete the computation. In this case, you can create your own Partition function for determining processor inter-communication and how to distribute the data.
@@ -27, +29 @@
=== Specify the partition files and directories ===
-
-
If the input is already partitioned, you can skip pre-partitioning step as following configuration:
{{{