You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Jingsong Lee (Jira)" <ji...@apache.org> on 2020/07/29 07:55:00 UTC
[jira] [Assigned] (FLINK-18508) Dynamic source supports statistics
and parallelism report
[ https://issues.apache.org/jira/browse/FLINK-18508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jingsong Lee reassigned FLINK-18508:
------------------------------------
Assignee: Jingsong Lee
> Dynamic source supports statistics and parallelism report
> ---------------------------------------------------------
>
> Key: FLINK-18508
> URL: https://issues.apache.org/jira/browse/FLINK-18508
> Project: Flink
> Issue Type: New Feature
> Components: Table SQL / API
> Reporter: Jingsong Lee
> Assignee: Jingsong Lee
> Priority: Major
> Fix For: 1.12.0
>
>
> Add SupportsStatisticsReport and SupportsParallelismReport to dynamic source, source can return some information to help table optimizer.
> This information can be more accurate from the source rather than the catalog.
> * First, the information is computed base on real data, for iceberg / filesystem connector, it can be calculated from real files. Although it is related to physical/runtime, it is real and exact.
> * Second, For example, for iceberg / filesystem connector, after filter and partition pushdown, the statistics have been greatly adjusted, and many files may have been filtered out.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)