You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Brandon Scheller (Jira)" <ji...@apache.org> on 2020/01/11 00:52:00 UTC

[jira] [Created] (HUDI-520) Decide on keyGenerator strategy for handling null/empty recordkeys

Brandon Scheller created HUDI-520:
-------------------------------------

             Summary: Decide on keyGenerator strategy for handling null/empty recordkeys 
                 Key: HUDI-520
                 URL: https://issues.apache.org/jira/browse/HUDI-520
             Project: Apache Hudi (incubating)
          Issue Type: Bug
            Reporter: Brandon Scheller


Currently key-generator implementations write out "__null__" for null values and "__empty__" for empty in order to provide a distinction between the two. This can add extra overhead to large datalakes that might not use this distinction.

This Jira is to decide on a consistent strategy for handling null/empty record keys in key generators.

 

The current strategy can be seen within ComplexKeyGenerator



--
This message was sent by Atlassian Jira
(v8.3.4#803005)