You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by yi...@apache.org on 2022/09/07 00:34:03 UTC

[hudi] branch asf-site updated: [HUDI-4635] Update 2022 H2 roadmap (#6613)

This is an automated email from the ASF dual-hosted git repository.

yihua pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/hudi.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new 195587d06b [HUDI-4635] Update 2022 H2 roadmap (#6613)
195587d06b is described below

commit 195587d06ba7aa2ffc421eb6c6eb426b4f02a63f
Author: Y Ethan Guo <et...@gmail.com>
AuthorDate: Tue Sep 6 17:33:58 2022 -0700

    [HUDI-4635] Update 2022 H2 roadmap (#6613)
---
 website/src/pages/roadmap.md | 102 +++++++++++++++++++++----------------------
 1 file changed, 49 insertions(+), 53 deletions(-)

diff --git a/website/src/pages/roadmap.md b/website/src/pages/roadmap.md
index 60ddd116ba..07d1b6f549 100644
--- a/website/src/pages/roadmap.md
+++ b/website/src/pages/roadmap.md
@@ -4,69 +4,65 @@ last_modified_at: 2019-12-30T15:59:57-04:00
 ---
 # Roadmap
 
-Hudi community strives to deliver major releases every 2-3 months, while offering minor releases every month!
+Hudi community strives to deliver major releases every 3-4 months, while offering minor releases every other month!
 This page captures the forward-looking roadmap of ongoing & upcoming projects and when they are expected to land, broken
 down by areas on our [stack](blog/2021/07/21/streaming-data-lake-platform/#hudi-stack).
 
-## H1 2022 Releases
+## H2 2022 Releases
 
-Next major release : [0.11.0](https://issues.apache.org/jira/projects/HUDI/versions/12350673) (Apr 2022)
+Next major release : [0.13.0](https://issues.apache.org/jira/projects/HUDI/versions/12352101) (Mid Q4 2022)
 
-|Release|Timeline|
-|------------|--------|
-|[0.10.1](https://issues.apache.org/jira/projects/HUDI/versions/12351135)|Jan 2022|
-|[0.11.0](https://issues.apache.org/jira/projects/HUDI/versions/12350673)|Apr 2022|
-|[0.12.0](https://issues.apache.org/jira/projects/HUDI/versions/12351209)|Jun 2022|
-|[1.0.0](https://issues.apache.org/jira/projects/HUDI/versions/12351210)|Summer 2022|
+| Release                                                              | Timeline       |
+|----------------------------------------------------------------------|----------------|
+| [0.12.1](https://issues.apache.org/jira/projects/HUDI/versions/12352182) | Early Q4 2022  |
+| [0.13.0](https://issues.apache.org/jira/projects/HUDI/versions/12352101) | Mid Q4 2022    |
+| [0.13.1](https://issues.apache.org/jira/projects/HUDI/versions/12352250) | TBD            |
 
-## Transactions/Database Layer
+## Transactional Database Layer
 
-|Feature|Target Release|Tracking|
-|------------|--------|-----------|
-|Space-filling curves hardening & perf improvements |0.11|[HUDI-2100](https://issues.apache.org/jira/browse/HUDI-2100)|
-|Metadata table update via multi-table transactions, turned on by default |0.11|[HUDI-1292](https://issues.apache.org/jira/browse/HUDI-1292)|
-|Metadata Index, as a bloom index alternative, fetching col_stats and bloom_filters from metadata table, improving upsert performance. |0.11|[HUDI-1822](https://issues.apache.org/jira/browse/HUDI-1822), [RFC-37](https://github.com/apache/hudi/pull/3989)|
-|Support for Encryption |0.11|[HUDI-2370](https://issues.apache.org/jira/browse/HUDI-2370)|
-|Schema-on-read for non-backwards compatible schema evolution |0.11|[HUDI-2429](https://issues.apache.org/jira/browse/HUDI-2429)|
-|Improvements to merge-on-read log merging/reading with streaming semantics |0.11|[HUDI-3081](https://issues.apache.org/jira/browse/HUDI-3081)|
-|Indexed columns support & elimination of partitioning |0.11|[HUDI-512](https://issues.apache.org/jira/browse/HUDI-512)|
-|Record-level index to speed up uuid based upserts/deletes |0.12|[HUDI-53](https://issues.apache.org/jira/browse/HUDI-53)|
-|Eager conflict detection for Optimistic Concurrency Control |0.12|[HUDI-1575](https://issues.apache.org/jira/browse/HUDI-1575)|
-|Indexed timeline and infinite retention of versions |0.12|RFC coming soon|
-|Improvements to streaming read and full CDC data model support |0.12| [HUDI-2749](https://issues.apache.org/jira/browse/HUDI-2749), RFC coming soon|
-|Consistent hashing based file distribution over storage to overcome throttling issues for very large tables |0.12|RFC published soon|
-|Lock free concurrency control |0.12 -> 1.0.0|[HUDI-3187](https://issues.apache.org/jira/browse/HUDI-3187)|
-|Non-blocking/Lock-free updates during clustering |0.12 -> 1.0.0|[HUDI-1042](https://issues.apache.org/jira/browse/HUDI-1042)|
-|Time Travel updates, deletes |0.12 -> 1.0.0 ||
-|General purpose support for multi-table transactions |0.12 -> 1.0.0||
+| Feature                                                                                | Target Release | Tracking                                                                                                                                                                                                                                                 |
+|----------------------------------------------------------------------------------------|----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| Metaserver for all metadata                                                            | 0.13.0         | [Strawman design](https://docs.google.com/presentation/d/1QBgLw11TM2Qf1KUESofGrQDb63EuggNCpPaxc82Kldo/edit#slide=id.ged084e5bf6_0_278), [RFC-36](https://github.com/apache/hudi/pull/4718), [HUDI-3345](https://issues.apache.org/jira/browse/HUDI-3345) |
+| Supervised table service planning and execution                                        | 0.13.0         | [RFC-43](https://github.com/apache/hudi/pull/4309), [HUDI-4147](https://issues.apache.org/jira/browse/HUDI-4147)                                                                                                                                         |
+| Support of Change Data Capture (CDC) with Hudi change logs                             | 0.13.0         | [RFC-51](https://github.com/apache/hudi/blob/master/rfc/rfc-51/rfc-51.md), [HUDI-3478](https://issues.apache.org/jira/browse/HUDI-3478)                                                                                                                  |
+| Record-level index to speed up UUID-based upserts and deletes                          | 0.13.0         | [RFC-08](https://cwiki.apache.org/confluence/display/HUDI/RFC-08++Record+level+indexing+mechanisms+for+Hudi+datasets), [HUDI-53](https://issues.apache.org/jira/browse/HUDI-53)                                                                          |
+| Consistent hashing index for dynamic buckets                                           | 0.13.0         | [RFC-42](https://github.com/apache/hudi/blob/master/rfc/rfc-42/rfc-42.md), [HUDI-3000](https://issues.apache.org/jira/browse/HUDI-3000)                                                                                                                  |
+| Secondary index to improve query performance                                           | 0.13.0         | [RFC-52](https://github.com/apache/hudi/pull/5370), [HUDI-3907](https://issues.apache.org/jira/browse/HUDI-3907)                                                                                                                                         |
+| Reducing write amplification with Log Compaction in MOR                                | 0.13.0         | [RFC-48](https://github.com/apache/hudi/pull/5041), [HUDI-3580](https://issues.apache.org/jira/browse/HUDI-3580)                                                                                                                                         |
+| Eager conflict detection for Optimistic Concurrency Control                            | 0.13.0         | [RFC-56](https://github.com/apache/hudi/pull/6003), [HUDI-1575](https://issues.apache.org/jira/browse/HUDI-1575)                                                                                                                                         |
+| Efficient bootstrap and migration of existing non-Hudi dataset                         | 0.13.0         | [HUDI-1265](https://issues.apache.org/jira/browse/HUDI-1265)                                                                                                                                                                                             |
+| Eliminating physical partitioning with efficient logical partitioning and file pruning | 0.13.0         | [HUDI-512](https://issues.apache.org/jira/browse/HUDI-512)                                                                                                                                                                                               |
+| Lock-Free message queue to improve writing efficiency                                  | 0.13.0         | [RFC-53](https://github.com/apache/hudi/blob/master/rfc/rfc-53/rfc-53.md), [HUDI-3963](https://issues.apache.org/jira/browse/HUDI-3963)                                                                                                                  |
+| Lock free concurrency control                                                          | 1.0.0 onward   | [HUDI-3187](https://issues.apache.org/jira/browse/HUDI-3187)                                                                                                                                                                                             |
+| Non-blocking/Lock-free updates during clustering                                       | 1.0.0 onward   | [HUDI-1042](https://issues.apache.org/jira/browse/HUDI-1042)                                                                                                                                                                                             |
+| Time Travel updates, deletes                                                           | 1.0.0 onward   ||
+| General purpose support for multi-table transactions                                   | 1.0.0 onward   ||
+
+## Programming APIs
+
+|Feature| Target Release |Tracking|
+|------------|----------------|-----------|
+| Redesign and optimization of record payload abstraction | 0.13.0         | [RFC-46](https://github.com/apache/hudi/blob/master/rfc/rfc-46/rfc-46.md), [HUDI-3217](https://issues.apache.org/jira/browse/HUDI-3217) |
+| Optimized storage layout for cloud object stores | 0.13.0         | [RFC-60](https://github.com/apache/hudi/pull/5113), [HUDI-3625](https://issues.apache.org/jira/browse/HUDI-3625) |
+| Support of verification with multiple event_time fields | 0.13.0         | [RFC-59](https://github.com/apache/hudi/pull/6382), [HUDI-4569](https://issues.apache.org/jira/browse/HUDI-4569) |
 
 ## Execution Engine Integration
 
-|Feature|Target Release|Tracking|
-|------------|--------|-----------|
-|Spark SQL DML fixes & enhancements |0.11|[HUDI-1658](https://issues.apache.org/jira/browse/HUDI-1658)|
-|Data-skipping for Hive and Spark based on col_stats from metadata table  |0.11|[HUDI-1296](https://issues.apache.org/jira/browse/HUDI-1296), [RFC-27](https://github.com/apache/hudi/pull/4280)|
-|Non-keyed tables with updates and deletes |0.11|[HUDI-2968](https://issues.apache.org/jira/browse/HUDI-2968)|
-|Trino Connector for Hudi, with read/query support  |0.12|[HUDI-2687](https://issues.apache.org/jira/browse/HUDI-2687), [RFC-38](https://github.com/apache/hudi/pull/3964)|
-|Spark Datasource V2|0.12|[HUDI-1297](https://issues.apache.org/jira/browse/HUDI-1297) ,[HUDI-2531](https://issues.apache.org/jira/browse/HUDI-2531)|
-|Complete ORC Support across query engines |0.12|[HUDI-57](https://issues.apache.org/jira/browse/HUDI-57)|
-|Presto Connector for Hudi, with read/query support |0.12|[PRESTO-17006](https://github.com/prestodb/presto/issues/17006)|
-|Multi-Modal indexing full integration across Presto/Trino/Spark queries |0.12 -> 1.0.0|[HUDI-1822](https://issues.apache.org/jira/browse/HUDI-1822)|
-|Materialized Views with incremental updates using Flink |1.0.0||
-|SQL DML support for Presto/Trino connectors (could be accelerated based on community feedback) |1.0.0||
-|Explore other execution engines/runtimes (Ray, native Rust, Python) |1.0.0||
+| Feature                                                                                        | Target Release | Tracking                                                                                                        |
+|------------------------------------------------------------------------------------------------|--------------|-----------------------------------------------------------------------------------------------------------------|
+| Spark datasource V2 read                                                                       | 0.13.0       | [HUDI-4449](https://issues.apache.org/jira/browse/HUDI-4449)                                                    |
+| Integrate column stats index with all query engines                                            | 0.13.0       | [RFC-58](https://github.com/apache/hudi/pull/6345), [HUDI-4552](https://issues.apache.org/jira/browse/HUDI-4552) |
+| Upgrade to Spark 3 as the default profile                                                      | 0.13.0       | [HUDI-3431](https://issues.apache.org/jira/browse/HUDI-3431)                                                    |
+| Materialized Views with incremental updates using Flink                                        | 1.0.0 onward ||
+| SQL DML support for Presto/Trino connectors (could be accelerated based on community feedback) | 1.0.0 onward ||
+| Explore other execution engines/runtimes (Ray, native Rust, Python)                            | 1.0.0 onward ||
 
 ## Platform Services
 
-|Feature|Target Release|Tracking|
-|------------|--------|-----------|
-|Native support for AWS Glue Metastore   |0.11|[HUDI-2757](https://issues.apache.org/jira/browse/HUDI-2757)|
-|BigQuery and Snowflake external table integration   |0.12|[RFC-34](https://github.com/apache/hudi/pull/4503)|
-|JDBC Incremental Source GA   |0.12|[HUDI-1859](https://issues.apache.org/jira/browse/HUDI-1859)|
-|Mutable, CDC Stream support for Kafka Connect Sink   |0.12|[HUDI-2324](https://issues.apache.org/jira/browse/HUDI-2324)|
-|Airbyte integration   |0.12|RFC coming soon|
-|Apache Pulsar integration for Delta Streamer (blocked on upstream)   |0.12|[HUDI-246](https://issues.apache.org/jira/browse/HUDI-246)|
-|Kinesis deltastreamer source, with DynamoDB CDC   |0.12|[HUDI-1386](https://issues.apache.org/jira/browse/HUDI-1386), [HUDI-310](https://issues.apache.org/jira/browse/HUDI-310)|
-|Support for reliable, event based ingestion from cloud stores - GCS, Azure and the others   |1.0.0|[HUDI-1896](https://issues.apache.org/jira/browse/HUDI-1896)|
-|Hudi Timeline Metaserver for locks, column status and table listings (could be accelerated based on community feedback)   |1.0.0|[Strawman design](https://docs.google.com/presentation/d/1QBgLw11TM2Qf1KUESofGrQDb63EuggNCpPaxc82Kldo/edit#slide=id.gf7e0551254_0_0) |
-|Mutable, Transactional caching for Hudi Tables   (could be accelerated based on community feedback) |1.0.0|[Strawman design](https://docs.google.com/presentation/d/1QBgLw11TM2Qf1KUESofGrQDb63EuggNCpPaxc82Kldo/edit#slide=id.gf7e0551254_0_5)|
+| Feature                                                                                             | Target Release | Tracking                                                                                                                                |
+|-----------------------------------------------------------------------------------------------------|--------------|-----------------------------------------------------------------------------------------------------------------------------------------|
+| Protobuf source for DeltaStreamer                                                                   | 0.13.0       | [RFC-57](https://github.com/apache/hudi/blob/master/rfc/rfc-57/rfc-57.md), [HUDI-4399](https://issues.apache.org/jira/browse/HUDI-4399) |
+| Hudi integration with Snowflake                                                                     | 0.13.0       | [RFC-41](https://github.com/apache/hudi/pull/4074), [HUDI-2832](https://issues.apache.org/jira/browse/HUDI-2832)                        |
+| Improving Hudi CLI features and usability                                                           | 0.13.0       | [HUDI-1388](https://issues.apache.org/jira/browse/HUDI-1388)                                                                            |
+| Support for reliable, event based ingestion from cloud stores - GCS, Azure and the others           | 1.0.0 onward | [HUDI-1896](https://issues.apache.org/jira/browse/HUDI-1896)                                                                            |
+| Mutable, Transactional caching for Hudi Tables   (could be accelerated based on community feedback) | 1.0.0 onward | [Strawman design](https://docs.google.com/presentation/d/1QBgLw11TM2Qf1KUESofGrQDb63EuggNCpPaxc82Kldo/edit#slide=id.gf7e0551254_0_5)    |
\ No newline at end of file