You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2016/03/23 02:23:25 UTC

[jira] [Assigned] (SPARK-14088) Some Dataset API touch-up

     [ https://issues.apache.org/jira/browse/SPARK-14088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-14088:
------------------------------------

    Assignee: Reynold Xin  (was: Apache Spark)

> Some Dataset API touch-up
> -------------------------
>
>                 Key: SPARK-14088
>                 URL: https://issues.apache.org/jira/browse/SPARK-14088
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Reynold Xin
>            Assignee: Reynold Xin
>             Fix For: 2.0.0
>
>
> 1. Deprecated unionAll. It is pretty confusing to have both "union" and "unionAll" when the two do the same thing in Spark but are different in SQL.
> 2. Rename reduce in KeyValueGroupedDataset to reduceGroups so it is more consistent with rest of the functions in KeyValueGroupedDataset. Also makes it more obvious what "reduce" and "reduceGroups" mean. Previously it was confusing because it could be reducing a Dataset, or just reducing groups.
> 3. Added a "name" function, which is more natural to name columns than "as" for non-SQL users.
> 4. Remove "subtract" function since it is just an alias for "except".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org