You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Sebastian Kruse (JIRA)" <ji...@apache.org> on 2014/07/29 14:47:38 UTC
[jira] [Created] (FLINK-1038) Adding a collection output format
Sebastian Kruse created FLINK-1038:
--------------------------------------
Summary: Adding a collection output format
Key: FLINK-1038
URL: https://issues.apache.org/jira/browse/FLINK-1038
Project: Flink
Issue Type: Improvement
Reporter: Sebastian Kruse
Priority: Minor
Similar to the existing LocalCollectionOutputFormat or Spark's collect() method, it would be nice to have a CollectionOutputFormat that also works when running jobs on a cluster. This output format gathers all results of a sink from all TaskManagers in the JVM that submitted the job plan and provides these as a collection, similar to accumulators. After all, this can help to avoid the tedious task of going to HDFS and read and parse the single result files.
PS. We have already created such an output format and can contribute it.
--
This message was sent by Atlassian JIRA
(v6.2#6252)