You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Aleksei Izmalkin (JIRA)" <ji...@apache.org> on 2018/09/24 07:55:00 UTC

[jira] [Commented] (FLINK-3133) Introduce collect()/count()/print() methods in DataStream API

    [ https://issues.apache.org/jira/browse/FLINK-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16625491#comment-16625491 ] 

Aleksei Izmalkin commented on FLINK-3133:
-----------------------------------------

Hello [~mxm],
I want to help with this issue. I read conversation history attentively. The last comment was left on August 24, 2017. It is more than year ago.
Is this issue still actual or it would be better to close this and open the new one with appropriate description?

> Introduce collect()/count()/print() methods in DataStream API
> -------------------------------------------------------------
>
>                 Key: FLINK-3133
>                 URL: https://issues.apache.org/jira/browse/FLINK-3133
>             Project: Flink
>          Issue Type: Improvement
>          Components: DataStream API
>    Affects Versions: 0.10.0, 0.10.1, 1.0.0
>            Reporter: Maximilian Michels
>            Assignee: Evgeny Kincharov
>            Priority: Major
>
> The DataSet API's methods {{collect()}}, {{count()}}, and {{print()}} should be mirrored to the DataStream API. 
> The semantics of the calls are different. We need to be able to sample parts of a stream, e.g. by supplying a time period in the arguments to the methods. Users should use the {{JobClient}} to retrieve the results.
> {code:java}
> StreamExecutionEnvironment env = StramEnvironment.getStreamExecutionEnvironment();
> DataStream<DataType> streamData = env.addSource(..).map(..);
> JobClient jobClient = env.executeWithControl();
> Iterable<DataType> sampled = jobClient.sampleStream(streamData, Time.seconds(5));
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)