You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@hudi.apache.org by "Xianghu Wang (Jira)" <ji...@apache.org> on 2022/03/01 06:38:00 UTC

[jira] [Commented] (HUDI-3522) Introduce DropColumnSchemaPostProcessor to support drop columns from schema

    [ https://issues.apache.org/jira/browse/HUDI-3522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17499354#comment-17499354 ] 

Xianghu Wang commented on HUDI-3522:
------------------------------------

[~shivnarayan] thanks for your response.

there is a scene that data is read from kafka, schema read from hive, but schema from hive has more columns than kafka. 

Before transformer take action, the schema from hive is used to convert kafka data to Dataset, we need to make them compatible

HUDI-3523 is also this scene

> Introduce DropColumnSchemaPostProcessor to support drop columns from schema
> ---------------------------------------------------------------------------
>
>                 Key: HUDI-3522
>                 URL: https://issues.apache.org/jira/browse/HUDI-3522
>             Project: Apache Hudi
>          Issue Type: Task
>          Components: deltastreamer
>            Reporter: Xianghu Wang
>            Assignee: Xianghu Wang
>            Priority: Major
>             Fix For: 0.11.0
>
>
> A SchemaPostProcessor to drop columns from given schema



--
This message was sent by Atlassian Jira
(v8.20.1#820001)