You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Huang Xingbo (Jira)" <ji...@apache.org> on 2022/05/19 12:25:00 UTC

[jira] [Assigned] (FLINK-27676) Output records from on_timer are behind the triggering watermark in PyFlink

     [ https://issues.apache.org/jira/browse/FLINK-27676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Huang Xingbo reassigned FLINK-27676:
------------------------------------

    Assignee: Juntao Hu

> Output records from on_timer are behind the triggering watermark in PyFlink
> ---------------------------------------------------------------------------
>
>                 Key: FLINK-27676
>                 URL: https://issues.apache.org/jira/browse/FLINK-27676
>             Project: Flink
>          Issue Type: Bug
>          Components: API / Python
>    Affects Versions: 1.15.0, 1.14.4
>            Reporter: Juntao Hu
>            Assignee: Juntao Hu
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.16.0, 1.14.5, 1.15.1
>
>
> Currently, when dealing with watermarks in AbstractPythonFunctionOperator, super.processWatermark(mark) is called, which advances watermark in timeServiceManager thus triggering timers and then emit current watermark. However, timer triggering is not synchronous in PyFlink (processTimer only put data into beam buffer), and when remote bundle is closed and output records produced by on_timer function finally arrive at Java side, they are already behind the triggering watermark.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)