You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2015/01/07 20:26:36 UTC

[jira] [Commented] (SPARK-5137) subtract does not take the spark.default.parallelism into account

    [ https://issues.apache.org/jira/browse/SPARK-5137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14268061#comment-14268061 ] 

Sean Owen commented on SPARK-5137:
----------------------------------

(You mean {{subtractByKey}}? that's the one in {{PairRDDFunctions}}. I think this applies to both.)

It uses the parallelism of 'self' by default, which seems like a good idea. Forcing the default parallelism by default could mean a pointless shuffle. You can override it if you need to. What should change?

> subtract does not take the spark.default.parallelism into account
> -----------------------------------------------------------------
>
>                 Key: SPARK-5137
>                 URL: https://issues.apache.org/jira/browse/SPARK-5137
>             Project: Spark
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>         Environment: CENTOS 6; scala
>            Reporter: Al M
>            Priority: Trivial
>
> The 'subtract' function (PairRDDFunctions.scala) in scala does not use the default parallelism value set in the config (spark.default.parallelism).  This is easy enough to work around.  I can just load the property and pass it in as an argument.
> It would be great if subtract used the default value, just like all the other PairRDDFunctions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org