You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Patrick Wendell (JIRA)" <ji...@apache.org> on 2014/06/12 17:17:01 UTC

[jira] [Updated] (SPARK-554) Add aggregateByKey

     [ https://issues.apache.org/jira/browse/SPARK-554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Patrick Wendell updated SPARK-554:
----------------------------------

    Summary: Add aggregateByKey  (was: Add foldByKey and aggregateByKey)

> Add aggregateByKey
> ------------------
>
>                 Key: SPARK-554
>                 URL: https://issues.apache.org/jira/browse/SPARK-554
>             Project: Spark
>          Issue Type: New Feature
>            Reporter: Matei Zaharia
>             Fix For: 1.1.0
>
>
> Similar to the new fold() and aggregate() methods in #95, we should have foldByKey and aggregateByKey for pair RDDs. The main thing that makes this slightly harder is that we'll have to change the combineByKey API and ShuffledRDD to allow taking in a "zero value".



--
This message was sent by Atlassian JIRA
(v6.2#6252)