You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/11/11 15:40:00 UTC

[jira] [Updated] (FLINK-14709) Allow outputting elements in close method of chained drivers.

     [ https://issues.apache.org/jira/browse/FLINK-14709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated FLINK-14709:
-----------------------------------
    Labels: pull-request-available  (was: )

> Allow outputting elements in close method of chained drivers.
> -------------------------------------------------------------
>
>                 Key: FLINK-14709
>                 URL: https://issues.apache.org/jira/browse/FLINK-14709
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Task
>    Affects Versions: 1.7.2, 1.8.1, 1.9.1
>            Reporter: David Moravek
>            Priority: Major
>              Labels: pull-request-available
>
> Currently, BatchTask and DataSourceTask only allow outputting elements in close method of "rich" operators, that they directly execute.
> Task workflow is as follows:
> 1) open "head" driver  (calls "open" method on udf)
> 2) open chained drivers
> 3) run "head" driver
> 4) close "head" driver (calls "close" method on udf)
> 5) close output collector (no elements can be collected after this point)
> 6) close chained drivers
> In order to properly support outputs from close method, we want to switch 6) and 5). We also need to tweak implementation of Reduce / Combine chained drivers, because they dispose sorters in closeTask method (this should be done in the close method).
> This would bring huge performance improvement for Beam users, because we could properly implement bundling on batch (whole partition = single bundle).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)