You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@flink.apache.org by "Robert Metzger (Jira)" <ji...@apache.org> on 2020/10/20 16:55:00 UTC

[jira] [Updated] (FLINK-19595) Flink SQL support S3 select

     [ https://issues.apache.org/jira/browse/FLINK-19595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Metzger updated FLINK-19595:
-----------------------------------
    Component/s: Table SQL / Ecosystem
                 FileSystems

> Flink SQL support S3 select
> ---------------------------
>
>                 Key: FLINK-19595
>                 URL: https://issues.apache.org/jira/browse/FLINK-19595
>             Project: Flink
>          Issue Type: Improvement
>          Components: FileSystems, Table SQL / Ecosystem
>            Reporter: liuxiaolong
>            Priority: Major
>
> Flink is based on S3AInputStream.java to select datas stored in Tencent COS, it will call the getObject function of AmazonS3Client.java. 
> Now, Tencent COS  have already support to pushdown the CSV and Parquert file format.
> In these cases, using getObject to select datas will wastes a lots of bandwidth.
> So, I think Flink SQL should support S3 Select, to reduce the waste of bandwidth.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)