You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Raymond Xu (Jira)" <ji...@apache.org> on 2022/03/29 18:02:00 UTC

[jira] [Updated] (HUDI-2832) [Umbrella] [RFC-40] Implement SnowflakeSyncTool to support Hudi to Snowflake Integration

     [ https://issues.apache.org/jira/browse/HUDI-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Raymond Xu updated HUDI-2832:
-----------------------------
    Fix Version/s: 0.12.0
                       (was: 0.11.0)

> [Umbrella] [RFC-40] Implement SnowflakeSyncTool to support Hudi to Snowflake Integration
> ----------------------------------------------------------------------------------------
>
>                 Key: HUDI-2832
>                 URL: https://issues.apache.org/jira/browse/HUDI-2832
>             Project: Apache Hudi
>          Issue Type: Epic
>          Components: Common Core
>            Reporter: Vinoth Govindarajan
>            Assignee: Vinoth Govindarajan
>            Priority: Major
>              Labels: BigQuery, Integration, pull-request-available
>             Fix For: 0.12.0
>
>
> Snowflake is a fully managed service that’s simple to use but can power a near-unlimited number of concurrent workloads. Snowflake is a solution for data warehousing, data lakes, data engineering, data science, data application development, and securely sharing and consuming shared data. Snowflake [doesn’t support|https://docs.snowflake.com/en/sql-reference/sql/alter-file-format.html] Apache Hudi file format yet, but it has support for Parquet, ORC, and Delta file format. This proposal is to implement a SnowflakeSync similar to HiveSync to sync the Hudi table as the Snowflake External Parquet table so that users can query the Hudi tables using Snowflake. Many users have expressed interest in Hudi and other support channels asking to integrate Hudi with Snowflake, this will unlock new use cases for Hudi.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)