You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Markus Holzemer (JIRA)" <ji...@apache.org> on 2014/06/30 11:13:24 UTC
[jira] [Assigned] (FLINK-760) Introduce a DataSet.distinct()
operator
[ https://issues.apache.org/jira/browse/FLINK-760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Markus Holzemer reassigned FLINK-760:
-------------------------------------
Assignee: Markus Holzemer
> Introduce a DataSet.distinct() operator
> ---------------------------------------
>
> Key: FLINK-760
> URL: https://issues.apache.org/jira/browse/FLINK-760
> Project: Flink
> Issue Type: Improvement
> Reporter: GitHub Import
> Assignee: Markus Holzemer
> Labels: github-import
> Fix For: pre-apache
>
>
> I think it would be more readable and user-friendly to have a DataSet.distinct() operator instead of manually use e.g. for Tuple5:
> ```java
> DataSet.groupBy(0,1,2,3,4).reduceGroup(new GroupReduceFunction<...>() {
> private static final long serialVersionUID = 1L;
> @Override
> public void reduce(Iterator<...> values, Collector<...> out) throws Exception {
> out.collect(values.next());
> }
> });
> ```
> ---------------- Imported from GitHub ----------------
> Url: https://github.com/stratosphere/stratosphere/issues/760
> Created by: [twalthr|https://github.com/twalthr]
> Labels: enhancement, java api,
> Milestone: Release 0.6 (unplanned)
> Created at: Tue May 06 11:28:02 CEST 2014
> State: open
--
This message was sent by Atlassian JIRA
(v6.2#6252)