You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Aljoscha Krettek (Jira)" <ji...@apache.org> on 2019/11/21 09:43:01 UTC

[jira] [Closed] (FLINK-14862) Fuse initalizeState and open passes in StreamTask#invoke

     [ https://issues.apache.org/jira/browse/FLINK-14862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aljoscha Krettek closed FLINK-14862.
------------------------------------
    Fix Version/s: 1.10.0
       Resolution: Fixed

Merged on master in cbe0f148f7cc6562a697b8b54faf410831c5f70b

> Fuse initalizeState and open passes in StreamTask#invoke
> --------------------------------------------------------
>
>                 Key: FLINK-14862
>                 URL: https://issues.apache.org/jira/browse/FLINK-14862
>             Project: Flink
>          Issue Type: New Feature
>          Components: Runtime / Task
>            Reporter: Igal Shilman
>            Assignee: Igal Shilman
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 1.10.0
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> Currently chained operators lack the ability to send records downstream during initializeState (AbstractStreamOperator#initalizeState), because downstream operators in the chain didn't open yet (AbstractStreamOperator#open).
> The main problem with that is that the rawOperatorState and the rawKeyedStateInputs are only valid during the execution of initializeState, and with large raw states materializing to hand them over to be used at open is not feasible (at least without spilling to disk)
> So, I would propose to fuse the loops here:
> https://github.com/apache/flink/blob/master/flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamTask.java#L435,L436
> Into a single loop that calls initalizeState and immediately open.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)