You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@flink.apache.org by rm...@apache.org on 2015/04/22 15:39:28 UTC

[3/3] flink git commit: [FLINK-1875] Add figure explaining slots and parallelism

[FLINK-1875] Add figure explaining slots and parallelism

This closes #604


Project: http://git-wip-us.apache.org/repos/asf/flink/repo
Commit: http://git-wip-us.apache.org/repos/asf/flink/commit/db608332
Tree: http://git-wip-us.apache.org/repos/asf/flink/tree/db608332
Diff: http://git-wip-us.apache.org/repos/asf/flink/diff/db608332

Branch: refs/heads/master
Commit: db608332e71b7202dde31180fe9f543eb7e5cca7
Parents: acd9235
Author: Robert Metzger <rm...@apache.org>
Authored: Thu Apr 16 09:36:33 2015 +0200
Committer: Robert Metzger <rm...@apache.org>
Committed: Wed Apr 22 15:25:13 2015 +0200

----------------------------------------------------------------------
 docs/config.md                 |   7 +-
 docs/faq.md                    |   5 +-
 docs/img/slots_parallelism.svg | 695 ++++++++++++++++++++++++++++++++++++
 pom.xml                        |   3 +-
 4 files changed, 702 insertions(+), 8 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/flink/blob/db608332/docs/config.md
----------------------------------------------------------------------
diff --git a/docs/config.md b/docs/config.md
index 251ce03..1068152 100644
--- a/docs/config.md
+++ b/docs/config.md
@@ -370,8 +370,7 @@ system, such as */tmp* in Linux systems.
 
 ### Configuring TaskManager processing slots
 
-A processing slot allows Flink to execute a distributed DataSet transformation, such as a
-data source or a map-transformation.
+Flink executes a program in parallel by splitting it into subtasks and scheduling these subtasks to processing slots.
 
 Each Flink TaskManager provides processing slots in the cluster. The number of slots
 is typically proportional to the number of available CPU cores __of each__ TaskManager.
@@ -383,6 +382,4 @@ The command line value therefore is called `-p` (for parallelism). In addition,
 to [set the number of slots in the programming APIs](programming_guide.html#parallel-execution) for 
 the whole application and individual operators.
 
-Flink is currently scheduling an application to slots by "filling" them up. 
-If the cluster has 20 machines with 2 slots each (40 slots in total) but the application is running
-with a parallelism of 20, only 10 machines will process data.
+<img src="img/slots_parallelism.svg" class="img-responsive" />
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/flink/blob/db608332/docs/faq.md
----------------------------------------------------------------------
diff --git a/docs/faq.md b/docs/faq.md
index 5c31bc6..c7794a1 100644
--- a/docs/faq.md
+++ b/docs/faq.md
@@ -103,8 +103,9 @@ parallelism has to be 1 and set it accordingly.
 
 The parallelism can be set in numerous ways to ensure a fine-grained control
 over the execution of a Flink program. See
-[Configuration](config.html#common-options) for detailed instructions on how to
-set the parallelism.
+the [Configuration guide](config.html#common-options) for detailed instructions on how to
+set the parallelism. Also check out [this figure](config.html#configuring-taskmanager-processing-slots) detailing 
+how the processing slots and parallelism are related to each other.
 
 ## Errors