You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Huang Xingbo (Jira)" <ji...@apache.org> on 2020/09/09 07:11:00 UTC

[jira] [Created] (FLINK-19169) Support Pandas UDAF in PyFlink (FLIP-137)

Huang Xingbo created FLINK-19169:
------------------------------------

             Summary: Support Pandas UDAF in PyFlink (FLIP-137)
                 Key: FLINK-19169
                 URL: https://issues.apache.org/jira/browse/FLINK-19169
             Project: Flink
          Issue Type: Improvement
          Components: API / Python
            Reporter: Huang Xingbo
             Fix For: 1.12.0


Pandas UDF has been supported in FLINK 1.11 ([FLIP-97|https://cwiki.apache.org/confluence/display/FLINK/FLIP-97%3A+Support+Scalar+Vectorized+Python+UDF+in+PyFlink]). It solves the high serialization/deserialization overhead in Python UDF and makes it convenient to leverage the popular Python libraries such as Pandas, Numpy, etc. Since Pandas UDF has so many advantages, we want to support Pandas UDAF to extend usage of Pandas UDF.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)