You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Yubin Li (Jira)" <ji...@apache.org> on 2022/04/06 16:51:00 UTC
[jira] [Updated] (FLINK-27100) Support parquet format in FileStore
[ https://issues.apache.org/jira/browse/FLINK-27100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yubin Li updated FLINK-27100:
-----------------------------
Description:
Apache Parquet is a very popular columnar file format, used in many data analysis engines like Hive/Impala/Spark/Flink. we could use simple command lines like parquet-tools to view metadata and data easily instead of using complex java code.
now flink-table-store only support ORC, but there are massive business data stored as Parquet format, developers/analysisers are very familliar with it, and Parquet has better support for impala engine.
maybe it's a good addition to make Parquet usable.
was:
Apache Parquet is a very popular columnar file format, used in many data analysis engines like Hive/Impala/Spark/Flink. we could use simple command lines like parquet-tools to view metadata and data easily instead of using complex java code.
now flink-table-store only support ORC, but there are massive business data stored as parquet format, developers/analysisers are very familliar with it, maybe it's a good addition to make Parquet usable.
> Support parquet format in FileStore
> -----------------------------------
>
> Key: FLINK-27100
> URL: https://issues.apache.org/jira/browse/FLINK-27100
> Project: Flink
> Issue Type: New Feature
> Components: Table Store
> Reporter: Yubin Li
> Priority: Major
>
> Apache Parquet is a very popular columnar file format, used in many data analysis engines like Hive/Impala/Spark/Flink. we could use simple command lines like parquet-tools to view metadata and data easily instead of using complex java code.
> now flink-table-store only support ORC, but there are massive business data stored as Parquet format, developers/analysisers are very familliar with it, and Parquet has better support for impala engine.
> maybe it's a good addition to make Parquet usable.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)