You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "lvhu (Jira)" <ji...@apache.org> on 2023/02/16 04:04:00 UTC

[jira] [Created] (HUDI-5810) Add hash partition

lvhu created HUDI-5810:
--------------------------

             Summary: Add hash partition
                 Key: HUDI-5810
                 URL: https://issues.apache.org/jira/browse/HUDI-5810
             Project: Apache Hudi
          Issue Type: New Feature
          Components: spark
            Reporter: lvhu
            Assignee: lvhu


It is often difficult to find an appropriate partition key in the existing data. Hash partitioning can easily solve this problem

When hash.partition.fields is specified and partition.fields contains _hoodie_hash_partition, a column named _hoodie_hash_partition will be added in this table as one of the partition key.

If predicates of hash.partition.fields appear in the query statement, the _hoodie_hash_partition = X predicate will be automatically added to the query statement for partition pruning.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)