You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Dong Lin (Jira)" <ji...@apache.org> on 2023/03/28 01:18:00 UTC
[jira] [Commented] (FLINK-31240) Reduce the overhead of conversion between DataStream and Table
[ https://issues.apache.org/jira/browse/FLINK-31240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705739#comment-17705739 ]
Dong Lin commented on FLINK-31240:
----------------------------------
[~yunfengzhou] Can you explain the impact of this performance optimizing in the JIRA description?
> Reduce the overhead of conversion between DataStream and Table
> --------------------------------------------------------------
>
> Key: FLINK-31240
> URL: https://issues.apache.org/jira/browse/FLINK-31240
> Project: Flink
> Issue Type: Improvement
> Components: Table SQL / API
> Reporter: Jiang Xin
> Assignee: Yunfeng Zhou
> Priority: Major
> Labels: pull-request-available
>
> In some cases, users may need to convert the underlying DataStream to Table and then convert it back to DataStream(e.g. some Flink ML libraries accept a Table as input and convert it to DataStream for calculation.). This would cause unnecessary overhead because of data conversion between the internal data type and the external data type.
> We can reduce the overhead by checking if there are paired `fromDataStream`/`toDataStream` function call without any transformation, if so using the source datastream directly.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)