You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spark.apache.org by pw...@apache.org on 2014/11/10 21:40:44 UTC
spark git commit: SPARK-4230. Doc for spark.default.parallelism is
incorrect
Repository: spark
Updated Branches:
refs/heads/master c5db8e2c0 -> c6f4e7042
SPARK-4230. Doc for spark.default.parallelism is incorrect
Author: Sandy Ryza <sa...@cloudera.com>
Closes #3107 from sryza/sandy-spark-4230 and squashes the following commits:
37a1d19 [Sandy Ryza] Clear up a couple things
34d53de [Sandy Ryza] SPARK-4230. Doc for spark.default.parallelism is incorrect
Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/c6f4e704
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/c6f4e704
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/c6f4e704
Branch: refs/heads/master
Commit: c6f4e704214097f17d2d6abfbfef4bb208e4339f
Parents: c5db8e2
Author: Sandy Ryza <sa...@cloudera.com>
Authored: Mon Nov 10 12:40:41 2014 -0800
Committer: Patrick Wendell <pw...@gmail.com>
Committed: Mon Nov 10 12:40:41 2014 -0800
----------------------------------------------------------------------
docs/configuration.md | 7 +++++--
1 file changed, 5 insertions(+), 2 deletions(-)
----------------------------------------------------------------------
http://git-wip-us.apache.org/repos/asf/spark/blob/c6f4e704/docs/configuration.md
----------------------------------------------------------------------
diff --git a/docs/configuration.md b/docs/configuration.md
index 0f9eb81..f0b396e 100644
--- a/docs/configuration.md
+++ b/docs/configuration.md
@@ -562,6 +562,9 @@ Apart from these, the following properties are also available, and may be useful
<tr>
<td><code>spark.default.parallelism</code></td>
<td>
+ For distributed shuffle operations like <code>reduceByKey</code> and <code>join</code>, the
+ largest number of partitions in a parent RDD. For operations like <code>parallelize</code>
+ with no parent RDDs, it depends on the cluster manager:
<ul>
<li>Local mode: number of cores on the local machine</li>
<li>Mesos fine grained mode: 8</li>
@@ -569,8 +572,8 @@ Apart from these, the following properties are also available, and may be useful
</ul>
</td>
<td>
- Default number of tasks to use across the cluster for distributed shuffle operations
- (<code>groupByKey</code>, <code>reduceByKey</code>, etc) when not set by user.
+ Default number of partitions in RDDs returned by transformations like <code>join</code>,
+ <code>reduceByKey</code>, and <code>parallelize</code> when not set by user.
</td>
</tr>
<tr>
---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org