You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Xianghu Wang (Jira)" <ji...@apache.org> on 2021/05/28 14:14:00 UTC

[jira] [Created] (HUDI-1946) Enhance SqlQueryBasedTransformer to allow user use wildcard to represent all the columns

Xianghu Wang created HUDI-1946:
----------------------------------

             Summary: Enhance SqlQueryBasedTransformer to allow user use wildcard to represent all the columns
                 Key: HUDI-1946
                 URL: https://issues.apache.org/jira/browse/HUDI-1946
             Project: Apache Hudi
          Issue Type: New Feature
            Reporter: Xianghu Wang
            Assignee: Xianghu Wang
             Fix For: 0.9.0


When the user wants to derive one or more columns from the existing columns and the
existing columns are all needed, the user needs to spell all the columns they need in the SQL.

This will be very troublesome and time-consuming when we have dozens or hundreds of columns. we can save trouble by using a wildcard in the SQL to represent all the columns.

that means if we have "id", "name", "age", "ts" these four columns already and we want to add a new column driver from ts, we can use

"select *, FROM_UNIXTIME(ts / 1000, 'yyyyMMdd') as dt from table_name"

to represent

"select id, name, age, ts, FROM_UNIXTIME(ts / 1000, 'yyyyMMdd') as dt from table_name"



--
This message was sent by Atlassian Jira
(v8.3.4#803005)