You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Chesnay Schepler (JIRA)" <ji...@apache.org> on 2015/01/28 10:56:34 UTC

[jira] [Commented] (FLINK-1459) Collect DataSet to client

    [ https://issues.apache.org/jira/browse/FLINK-1459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14294945#comment-14294945 ] 

Chesnay Schepler commented on FLINK-1459:
-----------------------------------------

there is currently a PR pending for this feature: https://github.com/apache/flink/pull/210

> Collect DataSet to client
> -------------------------
>
>                 Key: FLINK-1459
>                 URL: https://issues.apache.org/jira/browse/FLINK-1459
>             Project: Flink
>          Issue Type: Improvement
>            Reporter: John Sandiford
>
> Hi, I may well have missed something obvious here but I cannot find an easy way to extract the values in a DataSet to the client.  Spark has collect, collectAsMap etc...  
> (I need to pass the values from a small aggregated DataSet back to a machine learning library which is controlling the iterations.)
> The only way I could find to do this was to implement my own in memory OutputFormat.  This is not ideal, but does work.
> Many thanks, John
>   
> val env = ExecutionEnvironment.getExecutionEnvironment
>   val data: DataSet[Double] = env.fromElements(1.0, 2.0, 3.0, 4.0)
>   val result = data.reduce((a, b) => a)
>   val valuesOnClient = result.???
>   env.execute("Simple example")



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)