You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2016/12/14 18:03:58 UTC
[jira] [Resolved] (SPARK-18858) reduceByKey not avaiable on Dataset
[ https://issues.apache.org/jira/browse/SPARK-18858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen resolved SPARK-18858.
-------------------------------
Resolution: Invalid
Questions should really go to user@spark.apache.org. The Dataset API is different. I think you will end up using groupBy() and something like flatMapGroups() to achieve what you want.
> reduceByKey not avaiable on Dataset
> -----------------------------------
>
> Key: SPARK-18858
> URL: https://issues.apache.org/jira/browse/SPARK-18858
> Project: Spark
> Issue Type: Bug
> Affects Versions: 2.0.2
> Reporter: Jorge Machado
> Priority: Minor
>
> Hi,
> I donĀ“t really know if this is a bug or not.
> But having a Dataset it should be possible to do reduceByKey or not ?
> at the moment I have done the workaround with ds.rdd.ReduceByKey.
> reduce does not have the same signature
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org