You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Brandon Scheller (Jira)" <ji...@apache.org> on 2020/01/11 00:52:00 UTC
[jira] [Created] (HUDI-520) Decide on keyGenerator strategy for
handling null/empty recordkeys
Brandon Scheller created HUDI-520:
-------------------------------------
Summary: Decide on keyGenerator strategy for handling null/empty recordkeys
Key: HUDI-520
URL: https://issues.apache.org/jira/browse/HUDI-520
Project: Apache Hudi (incubating)
Issue Type: Bug
Reporter: Brandon Scheller
Currently key-generator implementations write out "__null__" for null values and "__empty__" for empty in order to provide a distinction between the two. This can add extra overhead to large datalakes that might not use this distinction.
This Jira is to decide on a consistent strategy for handling null/empty record keys in key generators.
The current strategy can be seen within ComplexKeyGenerator
--
This message was sent by Atlassian Jira
(v8.3.4#803005)