You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by bh...@apache.org on 2022/07/21 16:46:18 UTC

[hudi] branch asf-site updated: [DOCS] Redshift Spectrum update for old Hudi versions. (#6153)

This is an automated email from the ASF dual-hosted git repository.

bhavanisudha pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/hudi.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new c215bbe9f0 [DOCS] Redshift Spectrum update for old Hudi versions. (#6153)
c215bbe9f0 is described below

commit c215bbe9f0686fba2a589855262cc72f6eae45ba
Author: pomaster <ph...@yahoo.com>
AuthorDate: Thu Jul 21 12:46:11 2022 -0400

    [DOCS] Redshift Spectrum update for old Hudi versions. (#6153)
    
    s Please enter the commit message for your changes. Lines starting
---
 website/versioned_docs/version-0.10.0/query_engine_setup.md | 10 ++++++++++
 website/versioned_docs/version-0.10.0/querying_data.md      |  3 +++
 website/versioned_docs/version-0.5.2/querying_data.md       |  9 +++++++++
 website/versioned_docs/version-0.6.0/querying_data.md       |  9 +++++++++
 website/versioned_docs/version-0.7.0/querying_data.md       |  9 +++++++++
 website/versioned_docs/version-0.8.0/querying_data.md       |  9 +++++++++
 website/versioned_docs/version-0.9.0/querying_data.md       |  9 +++++++++
 7 files changed, 58 insertions(+)

diff --git a/website/versioned_docs/version-0.10.0/query_engine_setup.md b/website/versioned_docs/version-0.10.0/query_engine_setup.md
index eccae1b874..6e4f60b496 100644
--- a/website/versioned_docs/version-0.10.0/query_engine_setup.md
+++ b/website/versioned_docs/version-0.10.0/query_engine_setup.md
@@ -71,3 +71,13 @@ In order for Hive to recognize Hudi tables and query correctly,
 In addition to setup above, for beeline cli access, the `hive.input.format` variable needs to be set to the fully qualified path name of the
 inputformat `org.apache.hudi.hadoop.HoodieParquetInputFormat`. For Tez, additionally the `hive.tez.input.format` needs to be set
 to `org.apache.hadoop.hive.ql.io.HiveInputFormat`. Then proceed to query the table like any other Hive table.
+
+## Redshift Spectrum
+Copy on Write Tables in Apache Hudi versions 0.5.2, 0.6.0, 0.7.0, 0.8.0, 0.9.0, and 0.10.0 can be queried via Amazon Redshift Spectrum external tables.
+:::note
+Hudi tables are supported only when AWS Glue Data Catalog is used. It's not supported when you use an Apache Hive metastore as the external catalog.
+:::
+
+Please refer to [Redshift Spectrum Integration with Apache Hudi](https://docs.aws.amazon.com/redshift/latest/dg/c-spectrum-external-tables.html#c-spectrum-column-mapping-hudi)
+for more details.
+
diff --git a/website/versioned_docs/version-0.10.0/querying_data.md b/website/versioned_docs/version-0.10.0/querying_data.md
index c516708e7d..6e1c15aa71 100644
--- a/website/versioned_docs/version-0.10.0/querying_data.md
+++ b/website/versioned_docs/version-0.10.0/querying_data.md
@@ -204,6 +204,8 @@ After Hudi made a new commit, refresh the Impala table to get the latest results
 ```
 REFRESH database.table_name
 ```
+## Redshift Spectrum
+To set up Redshift Spectrum for querying Hudi, see the [Query Engine Setup](/docs/0.10.0/query_engine_setup#redshift-spectrum) page.
 
 ## Support Matrix
 
@@ -220,6 +222,7 @@ Following tables show whether a given query is supported on specific query engin
 |**PrestoDB**|Y|N|
 |**Trino**|Y|N|
 |**Impala**|Y|N|
+|**Redshift Spectrum** |Y|N|
 
 
 Note that `Read Optimized` queries are not applicable for COPY_ON_WRITE tables.
diff --git a/website/versioned_docs/version-0.5.2/querying_data.md b/website/versioned_docs/version-0.5.2/querying_data.md
index feedac14d0..fdf3b67fac 100644
--- a/website/versioned_docs/version-0.5.2/querying_data.md
+++ b/website/versioned_docs/version-0.5.2/querying_data.md
@@ -42,6 +42,7 @@ Following tables show whether a given query is supported on specific query engin
 |**Spark Datasource**|Y|Y|
 |**Presto**|Y|N|
 |**Impala**|Y|N|
+|**Redshift Spectrum**|Y|N|
 
 
 Note that `Read Optimized` queries are not applicable for COPY_ON_WRITE tables.
@@ -199,3 +200,11 @@ After Hudi made a new commit, refresh the Impala table to get the latest results
 ```
 REFRESH database.table_name
 ```
+## Redshift Spectrum
+Copy on Write Tables in Apache Hudi versions 0.5.2, 0.6.0, 0.7.0, 0.8.0, 0.9.0, and 0.10.0 can be queried via Amazon Redshift Spectrum external tables.
+:::note
+Hudi tables are supported only when AWS Glue Data Catalog is used. It's not supported when you use an Apache Hive metastore as the external catalog.
+:::
+
+Please refer to [Redshift Spectrum Integration with Apache Hudi](https://docs.aws.amazon.com/redshift/latest/dg/c-spectrum-external-tables.html#c-spectrum-column-mapping-hudi)
+for more details.
diff --git a/website/versioned_docs/version-0.6.0/querying_data.md b/website/versioned_docs/version-0.6.0/querying_data.md
index c275a7119a..5735713c59 100644
--- a/website/versioned_docs/version-0.6.0/querying_data.md
+++ b/website/versioned_docs/version-0.6.0/querying_data.md
@@ -42,6 +42,7 @@ Following tables show whether a given query is supported on specific query engin
 |**Spark Datasource**|Y|Y|
 |**PrestoDB**|Y|N|
 |**Impala**|Y|N|
+|**Redshift Spectrum**|Y|N|
 
 
 Note that `Read Optimized` queries are not applicable for COPY_ON_WRITE tables.
@@ -219,3 +220,11 @@ After Hudi made a new commit, refresh the Impala table to get the latest results
 ```
 REFRESH database.table_name
 ```
+## Redshift Spectrum
+Copy on Write Tables in Apache Hudi versions 0.5.2, 0.6.0, 0.7.0, 0.8.0, 0.9.0, and 0.10.0 can be queried via Amazon Redshift Spectrum external tables.
+:::note
+Hudi tables are supported only when AWS Glue Data Catalog is used. It's not supported when you use an Apache Hive metastore as the external catalog.
+:::
+
+Please refer to [Redshift Spectrum Integration with Apache Hudi](https://docs.aws.amazon.com/redshift/latest/dg/c-spectrum-external-tables.html#c-spectrum-column-mapping-hudi)
+for more details.
diff --git a/website/versioned_docs/version-0.7.0/querying_data.md b/website/versioned_docs/version-0.7.0/querying_data.md
index e477c86521..f97e0df504 100644
--- a/website/versioned_docs/version-0.7.0/querying_data.md
+++ b/website/versioned_docs/version-0.7.0/querying_data.md
@@ -42,6 +42,7 @@ Following tables show whether a given query is supported on specific query engin
 |**Spark Datasource**|Y|Y|
 |**PrestoDB**|Y|N|
 |**Impala**|Y|N|
+|**Redshift Spectrum**|Y|N|
 
 
 Note that `Read Optimized` queries are not applicable for COPY_ON_WRITE tables.
@@ -219,3 +220,11 @@ After Hudi made a new commit, refresh the Impala table to get the latest results
 ```
 REFRESH database.table_name
 ```
+## Redshift Spectrum
+Copy on Write Tables in Apache Hudi versions 0.5.2, 0.6.0, 0.7.0, 0.8.0, 0.9.0, and 0.10.0 can be queried via Amazon Redshift Spectrum external tables.
+:::note
+Hudi tables are supported only when AWS Glue Data Catalog is used. It's not supported when you use an Apache Hive metastore as the external catalog.
+:::
+
+Please refer to [Redshift Spectrum Integration with Apache Hudi](https://docs.aws.amazon.com/redshift/latest/dg/c-spectrum-external-tables.html#c-spectrum-column-mapping-hudi)
+for more details.
diff --git a/website/versioned_docs/version-0.8.0/querying_data.md b/website/versioned_docs/version-0.8.0/querying_data.md
index 32886eb943..ad929a6150 100644
--- a/website/versioned_docs/version-0.8.0/querying_data.md
+++ b/website/versioned_docs/version-0.8.0/querying_data.md
@@ -43,6 +43,7 @@ Following tables show whether a given query is supported on specific query engin
 |**Flink SQL**|Y|N|
 |**PrestoDB**|Y|N|
 |**Impala**|Y|N|
+|**Redshift Spectrum**|Y|N|
 
 
 Note that `Read Optimized` queries are not applicable for COPY_ON_WRITE tables.
@@ -262,3 +263,11 @@ After Hudi made a new commit, refresh the Impala table to get the latest results
 ```
 REFRESH database.table_name
 ```
+## Redshift Spectrum
+Copy on Write Tables in Apache Hudi versions 0.5.2, 0.6.0, 0.7.0, 0.8.0, 0.9.0, and 0.10.0 can be queried via Amazon Redshift Spectrum external tables.
+:::note
+Hudi tables are supported only when AWS Glue Data Catalog is used. It's not supported when you use an Apache Hive metastore as the external catalog.
+:::
+
+Please refer to [Redshift Spectrum Integration with Apache Hudi](https://docs.aws.amazon.com/redshift/latest/dg/c-spectrum-external-tables.html#c-spectrum-column-mapping-hudi)
+for more details.
diff --git a/website/versioned_docs/version-0.9.0/querying_data.md b/website/versioned_docs/version-0.9.0/querying_data.md
index 179da8c02a..8cb50d210a 100644
--- a/website/versioned_docs/version-0.9.0/querying_data.md
+++ b/website/versioned_docs/version-0.9.0/querying_data.md
@@ -43,6 +43,7 @@ Following tables show whether a given query is supported on specific query engin
 |**PrestoDB**|Y|N|
 |**Trino**|Y|N|
 |**Impala**|Y|N|
+|**Redshift Spectrum**|Y|N|
 
 
 Note that `Read Optimized` queries are not applicable for COPY_ON_WRITE tables.
@@ -275,3 +276,11 @@ After Hudi made a new commit, refresh the Impala table to get the latest results
 ```
 REFRESH database.table_name
 ```
+## Redshift Spectrum
+Copy on Write Tables in Apache Hudi versions 0.5.2, 0.6.0, 0.7.0, 0.8.0, 0.9.0, and 0.10.0 can be queried via Amazon Redshift Spectrum external tables.
+:::note
+Hudi tables are supported only when AWS Glue Data Catalog is used. It's not supported when you use an Apache Hive metastore as the external catalog.
+:::
+
+Please refer to [Redshift Spectrum Integration with Apache Hudi](https://docs.aws.amazon.com/redshift/latest/dg/c-spectrum-external-tables.html#c-spectrum-column-mapping-hudi)
+for more details.