You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2022/05/02 20:12:00 UTC
[jira] [Updated] (HUDI-3995) Avoid converting to Avro in standard Key Generators
[ https://issues.apache.org/jira/browse/HUDI-3995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
sivabalan narayanan updated HUDI-3995:
--------------------------------------
Status: Patch Available (was: In Progress)
> Avoid converting to Avro in standard Key Generators
> ---------------------------------------------------
>
> Key: HUDI-3995
> URL: https://issues.apache.org/jira/browse/HUDI-3995
> Project: Apache Hudi
> Issue Type: Improvement
> Components: spark, writer-core
> Reporter: sivabalan narayanan
> Assignee: sivabalan narayanan
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 0.12.0
>
> Original Estimate: 12h
> Remaining Estimate: 12h
>
> While investigating, perf hits in the Bulk Insert a few issues were found:
> # NonPartitionedKeyGenerator does not implement `getRecordKey`, `getParititionKey` for `InternalRow`, leading to invocation of default implementation converting row to Avro.
> # HUDI-3993: Using UDF to fetch record keys, similarly has to deserialize `InternalRow` into deserialized `Row`
>
--
This message was sent by Atlassian Jira
(v8.20.7#820007)