You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sandy Ryza (JIRA)" <ji...@apache.org> on 2014/05/10 03:12:13 UTC

[jira] [Commented] (SPARK-554) Add foldByKey and aggregateByKey

    [ https://issues.apache.org/jira/browse/SPARK-554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13993294#comment-13993294 ] 

Sandy Ryza commented on SPARK-554:
----------------------------------

https://github.com/apache/spark/pull/705

> Add foldByKey and aggregateByKey
> --------------------------------
>
>                 Key: SPARK-554
>                 URL: https://issues.apache.org/jira/browse/SPARK-554
>             Project: Spark
>          Issue Type: New Feature
>            Reporter: Matei Zaharia
>
> Similar to the new fold() and aggregate() methods in #95, we should have foldByKey and aggregateByKey for pair RDDs. The main thing that makes this slightly harder is that we'll have to change the combineByKey API and ShuffledRDD to allow taking in a "zero value".



--
This message was sent by Atlassian JIRA
(v6.2#6252)