You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hudi.apache.org by leesf <le...@gmail.com> on 2022/07/31 15:18:00 UTC

[ANNOUNCE] Hudi Community Update(2022-07-18 ~ 2022-07-31)

Dear community,

Nice to share Hudi community bi-weekly updates for 2022-07-18 ~ 2022-07-31
with updates on bug fixes.


=======================================
Features

[Core] Add FileBasedLockProvider [1]
[Spark] Allow loading external configs while querying Hudi tables with
Spark [2]
[Spark] Add sync validate procedure [3]
[Spark] Support Hudi with Spark 3.3.0 [4]


[1] https://issues.apache.org/jira/browse/HUDI-4065
[2] https://issues.apache.org/jira/browse/HUDI-3764
[3] https://issues.apache.org/jira/browse/HUDI-3510
[4] https://issues.apache.org/jira/browse/HUDI-4186


=======================================
Bugs

[Spark] Porting Nested Schema Pruning optimization for Hudi's custom
Relations [1]
[Spark] Replacing UDF in Bulk Insert w/ RDD transformation [2]
[Spark] Fix missing bloom filters in metadata table in non-partitioned
table [3]
[Spark] Fix insert into dynamic partition write misalignment [4]
[Spark] Make NONE sort mode as default for bulk insert [5]
[Spark] fix merge into sql data quality in concurrent scene [6]
[Core] Optimize performance of Column Stats Index reading in Data Skipping
[7]
[Spark] Addressing Spark SQL vs Spark DS performance gap [8]



[1] https://issues.apache.org/jira/browse/HUDI-3896
[2] https://issues.apache.org/jira/browse/HUDI-3993
[3] https://issues.apache.org/jira/browse/HUDI-4400
[4] https://issues.apache.org/jira/browse/HUDI-4404
[5] https://issues.apache.org/jira/browse/HUDI-4071
[6] https://issues.apache.org/jira/browse/HUDI-4348
[7] https://issues.apache.org/jira/browse/HUDI-4250
[8] https://issues.apache.org/jira/browse/HUDI-4081


Best,
Leesf