You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2020/09/18 02:53:42 UTC

[GitHub] [hudi] nsivabalan commented on a change in pull request #2073: [HUDI-769] Added blog for HoodieMultiTableDeltaStreamer

nsivabalan commented on a change in pull request #2073:
URL: https://github.com/apache/hudi/pull/2073#discussion_r490670265



##########
File path: docs/_posts/2020-08-22-ingest-multiple-tables-using-hudi.md
##########
@@ -0,0 +1,104 @@
+---
+title: "Ingest multiple tables using Hudi"
+excerpt: "Ingesting multiple tables using Hudi at a single go is now possible. This blog gives a detailed explanation of how to achieve the same using `HoodieMultiTableDeltaStreamer.java`"
+author: pratyaksh.sharma.hudi
+category: blog
+---
+
+When building a change data capture pipeline for already existing or newly created relational databases, one of the most common problems that one faces is simplifying the onboarding process for multiple tables. Ingesting multiple tables to Hudi dataset at a single go is now possible using `HoodieMultiTableDeltaStreamer` class which is a wrapper on top of the more popular `HoodieDeltaStreamer` class. Currently `HoodieMultiTableDeltaStreamer` supports **COPY_ON_WRITE** storage type only and the ingestion is done in a **sequential** way.

Review comment:
       my 2 cents. take your call though. Instead of "one of the most common problems that one faces is simplifying the onboarding process for multiple tables", how about "one of the major challenges is the complexity in onboarding multiple tables".

##########
File path: content/blog/ingest-multiple-tables-using-hudi/index.html
##########
@@ -0,0 +1,233 @@
+<!doctype html>

Review comment:
       yeah, I don't think you need this file. 

##########
File path: docs/_docs/2_2_writing_data.md
##########
@@ -210,6 +210,8 @@ Sample config files for table wise overridden properties can be found under `hud
   --op BULK_INSERT
 ```
 
+For detailed information on how to configure and use `HoodieMultiTableDeltaStreamer`, please refer [blog section](/blog/ingest-multiple-tables-using-hudi).

Review comment:
       I don't know, just confirming. do you know this link actually works. 




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org