You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "nadine (Jira)" <ji...@apache.org> on 2023/01/05 20:31:00 UTC
[jira] [Commented] (HUDI-5508) Revamp hudi homepage website

    [ https://issues.apache.org/jira/browse/HUDI-5508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17655126#comment-17655126 ] 

nadine commented on HUDI-5508:
------------------------------

# 
h3. Mutability support for all data lake workloads

 

Quickly update & delete data with Hudi’s fast, pluggable indexing. This includes streaming workloads, with full support for out-of-order data, bursty traffic & data deduplication.

[[https://hudi.apache.org/docs/next/indexing/]|https://hudi.apache.org/docs/next/indexing/]

 
 * *Available only on apache hudi* 

 
 # 
h3. Improved efficiency by incrementally processing new data

Replace old-school batch pipelines with incremental streaming on your data lake. Experience faster ingestion and lower processing times for analytical workloads. 

[[https://hudi.apache.org/blog/2020/08/18/hudi-incremental-processing-on-data-lakes/]|https://hudi.apache.org/blog/2020/08/18/hudi-incremental-processing-on-data-lakes/]




 * *Available only on apache hudi* 

 
 # 
h3. ACID Transactional guarantees to your data lake

Bring transactional guarantees to your data lake, with consistent, atomic writes and concurrency controls tailored for longer-running lake transactions.[ [https://hudi.apache.org/docs/use_cases#acid-transactions]|https://hudi.apache.org/docs/use_cases#acid-transactions]
 # 
h3. Unlock historical data with time travel

Query historical data with the ability to roll back to a table version; debug data versions to understand what changed over time; audit data changes by viewing the commit history.

[[https://hudi.apache.org/docs/next/use_cases#time-travel]]

[Use Cases | Apache Hudi|https://hudi.apache.org/docs/next/use_cases#time-travel]
 # 
h3. Interoperable multi-cloud ecosystem support

Extensive ecosystem support with plug-and-play options for popular data sources & query engines. Build future-proof architectures interoperable with your vendor of choice.

[[https://hudi.apache.org/docs/next/cloud]]

 
 # 
h3. Comprehensive table services for high-performance analytics

Fully automated table services that continuously schedule & orchestrate clustering, compaction, cleaning, file sizing & indexing to ensure tables are always ready.

[[https://hudi.apache.org/blog/2021/07/21/streaming-data-lake-platform/#table-services]]

 
 * *Available only on apache hudi* 

 
 # 
h3. A rich platform to build your lakehouse faster

Effortlessly build your lakehouse with built-in tools for auto ingestion from services like Debezium and Kafka and auto catalog sync for easy discoverability & more.




 * *Available only on apache hudi*

 * [[https://hudi.apache.org/blog/2022/01/14/change-data-capture-with-debezium-and-apache-hudi/]]

 
 # 
h3. Query acceleration through multi-modal indexes.

Experience faster write transactions on huge/wide tables & faster query performance with first-of-its kind multi-modal indexing subsystem. 

[[Multi-Modal Index for the Lakehouse in Apache Hudi|https://hudi.apache.org/blog/2022/05/17/Introducing-Multi-Modal-Index-for-the-Lakehouse-in-Apache-Hudi]]

 
 * *Available only on apache hudi* 

 
 # 
h3. Resilient Pipelines with schema evolution & enforcement 

Easily change the current schema of a Hudi table to adapt to the data that is changing over time and ensure pipeline resilience by failing fast and avoiding data corruption. 

[[https://hudi.apache.org/docs/next/schema_evolution/]|https://hudi.apache.org/docs/next/schema_evolution/]\

 

—

Reference: [https://hudi.apache.org/]






—----

*NEW: 01-04-23*

 

*WHY HUDI*

Take advantage of Hudi’s platform with rich services and tools to make your data lake actionable for applications like personalization, machine learning, customer 360 and more!

 
 * *Trusted Platform*

 # Battle tested and proven in production in some of the largest data lakes on the planet.

 * *Open source*

 # Hudi is a thriving & growing community that is built with contributions from people around the globe.

 * *Derived tables (2 options?)*

 # Seamlessly create and manage SQL tables on your data lake to build multi-stage incremental pipelines 

 * *Data streams*

 # Take advantage of built-in CDC sources and tools for streaming ingestion

|*Join community.* Get technical help, influence the product roadmap & see what’s new with Hudi!|
| |

> Revamp hudi homepage website
> ----------------------------
>
>                 Key: HUDI-5508
>                 URL: https://issues.apache.org/jira/browse/HUDI-5508
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: docs
>            Reporter: nadine
>            Assignee: nadine
>            Priority: Minor
>              Labels: website
>
> 1) work on feature blurbs to show how hudi adds value to someone's data stack
> 2) help improve hudi's overview architectural diagram



--
This message was sent by Atlassian Jira
(v8.20.10#820010)