You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Stephan Ewen (JIRA)" <ji...@apache.org> on 2014/07/29 14:57:39 UTC
[jira] [Commented] (FLINK-1038) Adding a collection output format
[ https://issues.apache.org/jira/browse/FLINK-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14077678#comment-14077678 ]
Stephan Ewen commented on FLINK-1038:
-------------------------------------
That would be very nice!
> Adding a collection output format
> ---------------------------------
>
> Key: FLINK-1038
> URL: https://issues.apache.org/jira/browse/FLINK-1038
> Project: Flink
> Issue Type: Improvement
> Reporter: Sebastian Kruse
> Priority: Minor
>
> Similar to the existing LocalCollectionOutputFormat or Spark's collect() method, it would be nice to have a CollectionOutputFormat that also works when running jobs on a cluster. This output format gathers all results of a sink from all TaskManagers in the JVM that submitted the job plan and provides these as a collection, similar to accumulators. After all, this can help to avoid the tedious task of going to HDFS and read and parse the single result files.
> PS. We have already created such an output format and can contribute it.
--
This message was sent by Atlassian JIRA
(v6.2#6252)