You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Fabian Hueske (JIRA)" <ji...@apache.org> on 2016/01/14 21:27:39 UTC
[jira] [Updated] (FLINK-87) Extend collectors to specify target
sink
[ https://issues.apache.org/jira/browse/FLINK-87?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Fabian Hueske updated FLINK-87:
-------------------------------
Component/s: (was: Java API)
DataSet API
> Extend collectors to specify target sink
> ----------------------------------------
>
> Key: FLINK-87
> URL: https://issues.apache.org/jira/browse/FLINK-87
> Project: Flink
> Issue Type: Improvement
> Components: DataSet API
> Reporter: Robert Metzger
> Labels: github-import
> Fix For: pre-apache
>
>
> This is an enhancement proposal for Stratosphere.
> It is possible to have multiple outputs for a given PACT, like the figure below illustrates:
> (SRC)
> |
> REDUCE
> / | \
> +-----+ | +-------+
> | | |
> (SINK A) | (SINK C)
> (SINK B)
> All records are going to all sinks.
> It would be preferable sometimes to select a sink for a record .. For example each group into a separate file (yes, I know, one could add a filter before each sink)
> We could add another collect() method like this
> public void collect(int sinkId, PactRecord record)
> The current collect() is quite simple:
> public void collect(PactRecord record)
> {
> for (int i = 0; i < writers.length; i++) {
> this.writers[i].emit(record);
> }
> }
> ---------------- Imported from GitHub ----------------
> Url: https://github.com/stratosphere/stratosphere/issues/87
> Created by: [rmetzger|https://github.com/rmetzger]
> Labels: enhancement,
> Assignee: [rmetzger|https://github.com/rmetzger]
> Created at: Wed Sep 11 17:49:47 CEST 2013
> State: open
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)