You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2015/01/07 20:26:36 UTC
[jira] [Commented] (SPARK-5137) subtract does not take the
spark.default.parallelism into account
[ https://issues.apache.org/jira/browse/SPARK-5137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14268061#comment-14268061 ]
Sean Owen commented on SPARK-5137:
----------------------------------
(You mean {{subtractByKey}}? that's the one in {{PairRDDFunctions}}. I think this applies to both.)
It uses the parallelism of 'self' by default, which seems like a good idea. Forcing the default parallelism by default could mean a pointless shuffle. You can override it if you need to. What should change?
> subtract does not take the spark.default.parallelism into account
> -----------------------------------------------------------------
>
> Key: SPARK-5137
> URL: https://issues.apache.org/jira/browse/SPARK-5137
> Project: Spark
> Issue Type: Bug
> Affects Versions: 1.2.0
> Environment: CENTOS 6; scala
> Reporter: Al M
> Priority: Trivial
>
> The 'subtract' function (PairRDDFunctions.scala) in scala does not use the default parallelism value set in the config (spark.default.parallelism). This is easy enough to work around. I can just load the property and pass it in as an argument.
> It would be great if subtract used the default value, just like all the other PairRDDFunctions.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org