You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2016/12/14 18:03:58 UTC

[jira] [Resolved] (SPARK-18858) reduceByKey not avaiable on Dataset

     [ https://issues.apache.org/jira/browse/SPARK-18858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen resolved SPARK-18858.
-------------------------------
    Resolution: Invalid

Questions should really go to user@spark.apache.org. The Dataset API is different. I think you will end up using groupBy() and something like flatMapGroups() to achieve what you want.

> reduceByKey not avaiable on Dataset
> -----------------------------------
>
>                 Key: SPARK-18858
>                 URL: https://issues.apache.org/jira/browse/SPARK-18858
>             Project: Spark
>          Issue Type: Bug
>    Affects Versions: 2.0.2
>            Reporter: Jorge Machado
>            Priority: Minor
>
> Hi, 
> I donĀ“t really know if this is a bug or not.
> But having a Dataset it should be possible to do reduceByKey or not ? 
> at the moment I have done the workaround with ds.rdd.ReduceByKey.
> reduce does not have the same signature



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org