You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/04/28 22:20:01 UTC

[GitHub] [hudi] bhasudha commented on a diff in pull request #5449: [HUDI-3911][DOCS][WIP] Add async indexing doc

bhasudha commented on code in PR #5449:
URL: https://github.com/apache/hudi/pull/5449#discussion_r861350782


##########
website/blog/2022-04-27-async-indexing.md:
##########
@@ -0,0 +1,213 @@
+---
+title: "Asynchronous Indexing using Hudi"
+excerpt: "How to setup Hudi for asynchronous indexing"
+author: codope 
+category: blog
+---
+
+In its rich set of asynchronous table services, Hudi has added yet another asynchronous service for indexing. It allows users to create different kinds of indexes (files, bloom
+filters, and column stats) on the Hudi table without blocking ingestion.
+
+<!--truncate-->
+
+## Introduction
+
+The metadata table in Hudi is an internal Merge-on-Read (MOR) table that has a single partition called `files` which stores the data partitions to files index that is used for
+efficient file listing. In the release version 0.11.0, we added support for multi-modal indexes which include two other types of indexes like `COLUMN_STATS` and `BLOOM_FILTERS`.
+Multi-modal indexes will greatly improve the record lookup time. Head over to that [blog](https://todo.add.link/) to learn more about multi-modal indexes. In this blog, we discuss how these indexes

Review Comment:
   got it. Thanks for the context.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org