You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Yuwei Xiao (Jira)" <ji...@apache.org> on 2022/07/11 04:00:00 UTC
[jira] [Created] (HUDI-4377) Support different split criteria for consistent hashing index resizing
Yuwei Xiao created HUDI-4377:
--------------------------------
Summary: Support different split criteria for consistent hashing index resizing
Key: HUDI-4377
URL: https://issues.apache.org/jira/browse/HUDI-4377
Project: Apache Hudi
Issue Type: New Feature
Reporter: Yuwei Xiao
Currently, we split at the middle of hash range when splitting hash buckets. This simple strategy may produce un-even children buckets when the data distribution is un-balanced.
A better strategy is to choose a split point which split bucket into a approximated even children buckets.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)