You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Wei Zhong (Jira)" <ji...@apache.org> on 2021/02/26 07:38:00 UTC

[jira] [Created] (FLINK-21509) "w.start" returns "1970-01-01" when used with Pandas UDAF after grouping by slide window with processing time

Wei Zhong created FLINK-21509:
---------------------------------

             Summary: "w.start" returns "1970-01-01" when used with Pandas UDAF after grouping by slide window with processing time
                 Key: FLINK-21509
                 URL: https://issues.apache.org/jira/browse/FLINK-21509
             Project: Flink
          Issue Type: Bug
          Components: API / Python
    Affects Versions: 1.12.1, 1.13.0
            Reporter: Wei Zhong


"w.start" returns "1970-01-01" when used with Pandas UDAF after grouping by slide window with processing time. Reproduce code:
{code:java}
t_env.get_config().get_configuration().set_string("parallelism.default", "1")
from pyflink.table.window import Slide
t_env.register_function("mean_udaf", mean_udaf)

source_table = """
    create table source_table(
        a INT,
        proctime as PROCTIME()
    ) with(
        'connector' = 'datagen',
        'rows-per-second' = '1',
        'fields.a.kind' = 'sequence',
        'fields.a.start' = '1',
        'fields.a.end' = '10'
    )
"""
t_env.execute_sql(source_table)
t = t_env.from_path("source_table")
iterator = t.select("a, proctime") \
    .window(Slide.over("1.seconds").every("1.seconds").on("proctime").alias("w")) \
    .group_by("a, w") \
    .select("mean_udaf(a) as b, w.start").execute().collect()
result = [i for i in iterator]
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)